r 

AD-A063  181 

UNCLASSIFIED 

POLYTECHNIC  INST  OF  NEW  YORK  BRC 
PROGRESS  REPORT  NUMBER  43  TO  THE 
NOV  78  A A OLINER 

POLY-MRI-452. 43-78 

>OKLYN  K 
JOINT  S 

IICROWAVE  RESE-- ETC  F/S  9/3 
.ERVICES  TECHNICAL  ADVIS— ETC<U) 
F44620-78-C-0074 

NL 

I 

i 

1 o'  6 

AO  A 
063181 

P« 

-v  - - 

r BK. 

<■  -wgi 

-jr  — 

A 

•wwrfC  V 

* 

9 • J 

\ 

*r 

1 

| 

Progress  Report  No.  43 

15  September  1977  through  14  September  1 978 


The  Joint  Services 
Technical  Advisory  Committee 


Representing:  The  Air  Force  Office  of  Scientific  Research 
The  U.S.  Army  Research  Office^ 

The  Office  of  Naval  Research  \ \ 


Submitted  by 


Arthur  A.  Oliner,  Director 
Microwave  Research  Institute 
Jean  Beirne  Maher  A. 


Coordinated  by 


for 

Contract  F44620-78-C-0074 
Project  No.  4751 
This  document  has  been 
approved  for  public  release  and 
sale;  its  distribution  is  unlimited. 


Report 
R-452. 43-78 


Submitted 

November 

1978 


Polytechnic 


Polytechnic  Institute  of  New  York 
333  Jay  Street,  Brooklyn,  New  York  11201 


Qualified  requestors  may  obtain  additional  copies  from  the  Defense  Documentation  Center;  all  others  should  apply  to 
the  Clearinghouse  for  Federal  Scientific  and  Technical  I nformation 


Js 


7o  f>-»  12  03 


Progress  Report  No.  43 

15  September  1977  through  14  September  1978 


The  Joint  Services  * 

Technical  Advisory  Committee 

Representing:  The  Air  Force  Office  of  Scientific  Research 
The  U.S.  Army  Research  Office 
The  Office  of  Naval  Research 


Submitted  by:  Arthur  A.  Oliner,  Director 

Microwave  Research  Institute 
Coordinated  by:  Jean  Beime  Maher 


Report 

R-452.43-78 

Submitted 

November 

1978 


for 

Contract  F44620-78-C-0074 
Project  No.  4751 
This  document  has  been 
approved  for  public  release  and 
sale;  its  distribution  is  unlimited. 


Polytechnic 

Polytechnic  Institute  of  New  York 
333  Jay  Street,  Brooklyn,  New  York  11201 

Qualified  requestors  may  obtain  additional  copies  from  the  Defense  Documentation  Center;  all  others  should  apply  to 
the  Clearinghouse  for  Federal  Scientific  and  Technical  I nformation 


PREFACE 

This  report  to  the  Joint  Services  Technical  Advisory  Committee  presents  a 
summary  of  the  research  programs  in  the  broad  field  of  electronics  conducted  during 
the  past  year  at  the  Polytechnic  Institute  of  New  York.  These  programs  are  pursued 
within  the  framework  of  the  Microwave  Research  Institute,  and  they  involve  the 
academic  research  activities  of  faculty  in  the  departments  of  Electrical  Engineering, 
Physics  and  Chemistry.  The  research  projects  cover  a broad  spectrum  ranging  from 
basic  theoretical  investigations  in  physics,  applied  mathematics,  and  engineering  to 
experimental  efforts  involving  basic  measurements  and  the  development  of  devices  and 
materials . 

The  format  of  this  annual  report  permits  a coherent  presentation  of  the  various 
phases  of  the  Joint  Services  Electronics  Program  (JSEP)  at  the  Polytechnic  and 
their  relation  to  ongoing  research  in  electronics  sponsored  by  other  agencies.  This 
presentation  is  intended  for  the  information  of  the  Air  Force  Office  of  Scientific 
Research,  the  Army  Research  Office  and  the  Office  of  Naval  Research  and,  in 
addition,  the  other  sponsors  who  are  individually  acknowledged  throughout  the  report. 
The  principal  aims  of  the  JSEP  are  to  initiate  deserving  lines  of  research  in  a timely 
fashion  and  to  develop  investigations  to  a stature  sufficient  to  attract  individual 
support  on  their  own  merits. 

In  the  early  days  of  the  Microwave  Research  Institute,  the  research  program 
consisted  primarily  of  projects  involving  electromagnetics  and  microwave  components. 
Although  the  name  of  the  Institute  has  remained  the  same,  the  nature  of  the  research 
programs  has  broadened  substantially,  and  the  programs  now  encompass  a wide 
range  of  topics  within  the  field  of  electronics.  The  current  programs  are  organized 
into  twelve  areas:  electromagnetics;  acoustics;  optics;  quantum  electronics;  solid 
state  and  materials;  wave-matter  interactions;  electric  power  engineering;  communi- 
cations; computers  and  computer-communication  networks;  safety,  reliability  and 
software  engineering;  systems,  control  and  networks;  and  image  processing. 


Arthur  A.  Oliner 
Director 
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ELECTROMAGNETICS 


HIGH  FREQUENCY  FIELDS  EXCITED  BY  A LINE  SOURCE  LOCATED  ON  A 
CONCAVE  CYLINDRICAL  IMPEDANCE  SURFACE 

T.  Ishihara  and  L.  B.  Felsen 

A.  Introduction 

When  high-frequency  fields  impinge  on  a concave  surface  with  large  radius  of 
curvature,  the  induced  fields  thereon  differ  markedly  from  those  for  the  well  explored 
convex  case.  A recent  study*  has  dealt  in  detail  with  the  two-dimensional  problem 
posed  by  line  source  excitation  on  the  interior  boundary  of  a perfectly  conducting  circu- 
lar cylinder.  Alternative  field  representations,  accounting  only  for  propagation  phe- 
nomena on  the  surface  segment  between  source  and  observation  points,  were  formulated 
and  evaluated  asymptotically.  The  field  expressions  so  obtained  were  interpretable 
physically  in  terms  of  constituents  involving  whispering  gallery  modes,  ray-optical 
fields,  near  field  effects,  continuous  spectrum  and  canonical  integral  contributions. 
Extensive  numerical  comparisons  showed  the  accuracy  and  range  of  applicability  of 
each.  This  investigation  has  provided  fundamental  physical  and  quantitative  insight 
into  the  field  behavior  on  the  perfectly  conducting  boundary. 

Here  the  analysis  is  extended  to  accommodate  surfaces  with  non-vanishing 

surface  impedance  Zg.  Because  the  alternative  field  representations  (Section  B)  for 

Z ^ 0 are  derived  by  the  same  techniques  as  those  for  Z =0,  the  presentation  is 
s s j 

kept  concise,  with  frequent  reference  made  to  the  earlier  study  for  some  of  the  de- 
tails. Representative  numerical  calculations  in  Section  C show  how  increasing  surface 
impedance  affects  the  surface  field.  When  losses  are  appreciable,  the  closely  bound 
whispering  gallery  modes  are  rapidly  attenuated,  leaving  a properly  formulated  ray- 
optical  field  as  the  dominant  and  adequate  contributor. 

B.  Alternative  Representations 
1 . Basic  Green's  Function 

The  relevant  magnetic  line  source  Green's  function  for  the  interior  of  a circular 

cylinder  with  radius  a and  constant  surface  impedance  Z is  formulated  in  an  infinite 

s 2 

angular  space  wherein  the  azimuthal  coordinate  <j>  ranges  from  -oo  to  +°c.  With  the 
source  point  located  at  ( p* , <j>’)  in  a cylindrical  ( p,  <j>)  coordinate  system,  the  Green's 
function  satisfies  the  inhomogeneous  wave  equation 

P#p  + -T  7T  +fc2>G(£>  £’>  = 6<<M>’)  6(p-p')  (l) 

P o<)> 

with  an  "angular  radiation  condition"  in  the  <j>  direction,  and  the  radial  boundary 
condition 
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■2—  G = ikZ 1 G at  p = a ; G finite  at  p = 0 (la) 

OP 

Here  Z'  = Zs/Zo  is  a normalized  surface  impedance,  and  Zq  is  the  impedance  of  free  space. 
A time  factor  exp(-iwt)  is  suppressed. 

Expressing  the  Green's  function  G as  a contour  integral  and  removing  diffraction 
effects  at  the  origin  arising  from  the  "angularly  matched"  condition,  one  obtains  a 
modified  form  that  contains  only  the  essential  propagation  phenomena  from  the  source 
point  to  the  observation  point  in  the  presence  of  a cylindrical  boundary  segment.  When 
the  source  and  observation  points  are  both  located  on  the  impedance  boundary  (i.e., 
p = p1  = a),  the  modified  Green's  function  G becomes 

G = — L_  / dv  (2) 

i(TTka)2  C [J'v(ka).iZ'Jv(ka)]  [H’^ka)  - iZ'H^(ka)  ] 

where  k is  the  free-space  wavenumber,  and  the  prime  on  the  cylinder  functions  denotes 
the  derivative  with  respect  to  the  argument.  The  contour  C and  the  singularities  of 
the  integrand  in  the  complex  v-plane  are  shown  in  Figure  1.  In  the  lossless  limit 
Z!  = 0,  this  formulation  reduces  to  the  one  in  Reference  1. 


V-  Plans 


Fig.  1.  Integration  path  and  singularities  in  complex  v-plane. 

xx--zeros  v^  of  [ H^^(ka)  - iZ'H^(ka)];  ••--zeros 

v of  [ J'(ka)  - iZ'J  (ka)]  . 
m 1 v "v  J 

Anticipating  subsequent  contour  deformation  in  the  complex  v-plane,  we  examine 
the  location  of  the  pole  singularities  vm  determined  by  the  resonance  equation 

J'  (ka)  - iZ'J  (ka)  = 0,  m = 1 , 2,  . . . , Mj , Re  vm  > 0 (3) 

Vm  Vm 
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These  poles  in  the  integrand  of  Eq.  (2)  describe  whispering  gallery  modes.  For  the 
special  cases  arg  Z1  = ±90°,  the  roots  Vm  are  real.  If  for  arg  Z'  = -90°,  the  magni- 
tude |Z'|  of  the  normalized  impedance  is  greater  than  a certain  value,  one  finds  for 
the  first  root  m = 1 that  > ka.  The  corresponding  field  contribution  then  describes 
a surface  wave  that  can  exist  also  on  a plane  boundary.  It  is  interesting  to  note  that 
all  of  the  whispering  gallery  modes  represent  fast  waves  when  the  boundary  is  perfectly 
conducting.^  However,  for  Z1^  0,  there  may  exist  slow-wave  type  whispering  gallery 

modes  with  Re  v > ka,  which  become  surface  waves  when  Im  v = 0,  in  addition, 
m m 

The  resonance  equation  for  the  first  M whispering  gallery  modes  bound  most  closely 
to  the  impedance  surface  is  found  to  be 

l 

M'(t  ) + id  Ai(t  ) = 0,  m=l,2,  ...,M  , a = Z'(^)  3 (4a) 

m m c. 

where  the  t are  solutions  o£  the  differential  equation 
m 


t = -ia 


In  the  other  domain,  where  | v - ka  | > 0(|v|  ^ ),  one  may  employ  the  Debye  approxi- 
mations * to  obtain  the  resonance  equation 

cos  w cos  [t(w  ) + “ ] - iZ  'sm[£(w  )+•?]  = 0,  m = M + 1 , . ■ . M.  (5) 

m m'  4 J nr  4 J ’ ’ 1 ' ' 


where 


£(w)  = ka  [cos  w - (-jr  - w)  sin  w]  , v = ka  sin  w,  Re  w > 0 


It  is  found  that  the  two  appr oxi mations  in  Eqs,  (4a)  and  (5)  have  an  overlapping  domain 
wherein  one  may  switch  from  one  to  the  other.  For  the  case  in  Fig.  2,  M = 4 has  been 
found  satisfactory. 

2.  Whispering  Gallery  (W.G.)  Mode  and  Continuous  Spectrum  Representation 

When  the  contour  C in  Fig.  1 is  deformed  into  a contour  extending  along  the 
imaginary  v-axis,  the  Green's  function  can  be  represented  as  a sum  of  whispering 
gallery  (W.G.)  modes  Gm  and  a continuous  spectrum  : 


G = Tj  G 


+ Rx. 

m M . 


r 


Fig.  2.  Zeros  of  [J'v(ka)  -iZ'Jv(ka)]  in  complex 
w-plane.  ••--zeros  for  Z'  = 0 (perfect 
conductor);  oo--zeros  for  Z'=0.05  exp(-i30.5°) 

(small  loss);  ■■ — , 4A— zeros  for  Z'=0.22 
exp(-i30.  5°)  (large  loss).  The  first  five  roots 
are  obtained  from  (5b)  and  compared 
with  ■■--  obtained  from  (7). 

where 

Jv  (ka)  exp[ivm|4>-<|),|  + iTT/2] 

G = —51 (7) 

m ka(d/8v)  {J'  (ka)-iZ'Jv(ka)} 

vm 

with  the  v satisfying  the  resonance  equation  in  (3).  Using  the  Fock  type  transition 
m j 

region  approximations,  one  finds 


(8) 


(9) 


Here,  s denotes  the  path  length  between  source  and  observation  point  along  the  boundary, 
and  Y is  an  arc  length  parameter.  For  the  remaining  modes,  Debye  approximations 
and  Eq.  (5)  may  be  used  to  obtain 
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G 

m 


exp(iks  sin  wm  + irr/2 


ka(rr/2-w  ) cos  w [l  - (Z'/cosw  ) ] 


’27"  * tn  = M + 1 ...  Mj 


(10) 


From  the  behavior  of  the  roots  as  discussed  in  Section  1,  one  can  see  that  the  residue 
contribution  arising  from  the  m = 1 pole  singularity  at  tj  {the  first  root  of  Eq.  (4)) 
describes  either  a whispering  gallery  mode  bound  most  closely  to  the  impedance 
boundary  or  a surface  wave  mode  when  Re  tj  > 0,  Imtj  = 0.  In  the  latter  instance, 

G j in  Eq.  (8)  becomes  a surface  wave  field. 

As  in  the  lossless  case,  the  continuous  spectrum  R^  can  be  approximated  by**  4 


00 

1_  r exp(-ks  sinhv)  ^ 
11  o l-(Z'/coshv)^ 


(ID 


3.  Ray-Optical  Plus  Slow  Wave  Representation 

With  a view  toward  a ny-optical  representation,  it  is  convenient  to  deform  the 
contour  C in  Fig.  1 into  C,  whence  G becomes 

G-e  + GjUttj)  (12) 

where  G is  the  same  as  G except  for  the  replacement  of  C by  C in  Eq.  (2),  and  Gj  is 
the  contribution  (if  any)  from  the  pole  singularity  at  tj  located  between  C and  "C.  The 
unit  step  function  U(tj)  contributes  only  when  the  tj  pole  lies  between  the  two  contours. 
The  extraction  of  Gj,  which  may  be  either  a slow  wave  W.  G.  mode  or  a surface  wave, 
is  desirable  since  slow  wave  fields  do  not  exhibit  ray-optical  properties.  Thereafter, 
the  Bessel  function  denominator  in  Eq.  (2)  is  expanded  into  a "traveling  wave"  series, 
Debye  approximations  are  employed,  and  the  resulting  integrals  are  evaluated  along 
steepest  descent  paths  passing  through  saddle  points  vgn  in  the  complex  v-plane 
(Fig.  3).  The  result  is 


oo 

s = v € 

lj  n 


n=0 

where 


<3  - 

n 


(-i) 


n 


(T  )ncos^w  exp(ikD  ) 

n*  an  n 

(cos  wgn  + 


(13) 


(14) 
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Fig.  3.  Steepest  descent  paths  in  complex  v -plane , 
Dn  = 2(n+  1)  a sin  [|<M»'|/2(n+  1)]  , w3n  = J 

and  Tn  is  the  boundary  reflection  coefficient  for  ray  species  n, 

cosw  - Z 1 

j.  _ sn 

1n  cos  w +? 1 
sn 


(14a) 


(14b) 


The  expression  in  Eq.  (14)  corresponds  to  a geometric-optical  ray  field  that  has  under- 
gone n reflections  at  the  impedance  boundary  (see  Fig.  4).  The  sum  in  Eq.  (13)  can 


Fig.  4.  Direct  and  multiply  reflected  rays  for  a circular 
boundary.  Geometrical  quantities  Dq,  Dp  D2, 
wao»  wsl  and  are  also  shown. 
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actually  not  be  extended  to  n -*  o'  since  the  Debye  approximations  used  to  obtain  Eq.  (14) 
become  invalid  as  wgn  -*  Tt/2.  This  situation  arises  either  when  the  observation  point 
approaches  the  source  point  so  that  |<()-<|>,|  -»  0,  or  when  the  number  of  rays  increases, 
with  | I fixed.  The  latter  case  can  be  avoided  by  truncating  the  number  of  rays  at  some 
n = N such  that  wgN  is  sufficiently  less  than  tt/2  (see  Section  4)  while  the  former  case 
will  be  discussed  in  Section  6. 

4.  Ray  Plus  Canonical  Integral  Plus  Slow  Wave  Representation 

Truncating  the  ray  series  leads  to  the  representation 
N 

G = S fin  + *N  + C1  u(tl>  (15) 

n=0 

where 


= 


1 

i(rrka)^ 


/ __  

Z CH^^ka)  - iZ'H(vZ,(ka)][j^(ka)  -iZ'Jv(ka)] 


dv 


(15a) 


£n»  given  in  Eq.  (17),  again  describes  a geometrical  ray  field  having  undergone  n re- 
flections. The  remainder  term  RN  accounts  for  ray  fields  reflected  more  than  N times. 

One  way  of  dealing  with  Eq.  (15a)  is  to  investigate  how  fast  the  integrand  decays  in 
the  neighborhood  of  the  point  v = ka:  rapid  decay  permits  effective  truncation  of  the 
integration  path  and  the  use  of  further  approximations.  Proceeding  as  in  the  Z'  = 0 
case,  one  may  show  that  the  main  contribution  to  the  integral  arises  from  the  neighbor- 
hood of  v = ka  provided  that  the  arc  length  parameter  y defined  in  Eq.  (9)  and  the 
number  of  rays  (N+l)  satisfy  the  left-hand  portion  of  the  inequality 

p j-  ■ 1 < (N  + 1)  < r , (16) 

2(2-  A?  2(2IA)1 

where  A = A(ka,  Z ) = 0(1)  and  | Z * | « 1.  The  right -ha.id  portion  of  the  inequality 
validates  use  of  the  Debye  approximations  which  have  been  used  to  derive  5 in  Eq, 

(14).  The  complete  inequality  in  Eq.  (16)  provides  a criterion  for  the  number  of  geo- 
metric optical  ray  fields  that  are  identified  in  Eq.  (15).  The  precise  choice  of  A is 
not  too  important;  a value  A 2 has  been  found  adequate  in  our  various  numerical 
comparisons1  (Section  C). 

Since  the  contributing  portion  of  the  integrand  in  Eq.  (15a)  is  localized  near  v = kaf 
one  may  substitute  the  Fock  type  asymptotic  approximations  to  obtain 
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» - (ta,  t y Y , a) 

2in  ka 


*•(£> 3 


(17) 


Here,  Lj(Y»  h)  i»  the  canonical  integral 

°°*  ^ [(W|(t)  + iawj(t)  }/{w2(t)  + iaw2(t)}]N+1 

^ i4n/3  Cw2<*)  + i»w2(t)  ] [Ai(t)  + iaAi(t)] 


eiYtdt 


(18) 


«ee 


which  has  been  evaluated  for  relevant  ranges  of  Y and  three  values  of  a.  The  results 
are  shown  in  Fig.  11  and  have  been  tabulated.  5 The  canonical  integral  for  a related 
boundary  value  problem  corresponding  to  a different  integrand  and  to  a = 0 has  been 
tabulated  by  Babich  and  Buldyrev.  ^ 

5.  Ray  Plus  Whispering  Gallery  (W.G.)  Mode  Plus  Slow  Wave  Representation 

An  alternative  way  of  treating  Eq.  (15a)  is  to  deform  the  contour  Z into  CN  in 
Fig.  3 and  thereby  representing  R^  as  a sum  of  [M-U(tj)]  W.G.  modes  whose  poles 
lie  between  Z and  CN,  plus  some  other  remainder  term  RMN»  One  finds 


M 

l 

m=TR 


RN  " E Gm  + RMN 


m = 1 + U(tj) 


(19) 


where 


• “ T 6M(1 


RMN  • "7  WN' 


Z* 


cos  w 


(19a) 


sN 


Thus 


N 

° - E c„  ♦ 

n=0  m=m 


E Cm  ’ 2 ^N(1  "cosw 


(20) 


sN 


6.  Near  Field  Form 

As  observed  at  the  end  of  Section  3 (see  also  Eq.  (16)),  any  Green's  function 
representation  that  includes  geometric  optical  terms  fails  in  the  near  field  where  the 
arc  length  parameter  Y in  Eq.  (9)  is  less  than  2(21^3  A)1^2.  For  this  case,  one  may 
find  the  near  field  representation: 


G - 


Biks  + in/ 4 
V2nks 


21 

1 ♦ l o,  v/2  * 

j=l 


0(YU) 


M<  i 


(21) 
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where 


Jrf  a.  e 


V>  “ a 


i(^j)  / j/2 

I6  / W 

} ' m=l 


j odd 


(2m-l),  j even 


n 

The  coefficients  a^  have  been  given  elsewhere.  The  restriction  |a  | <1  in  Eq.  (21) 
arises  from  the  condition  | a Ai(t)/Ai'(t)  | < 1.  In  view  of  the  definition  of  a in  Eq.  (4a), 
the  solution  Eq.  (21)  can,  therefore,  not  be  applied  to  the  infinite  plane  limit  a if 
Z'  differs  from  zero. 

C,  Nume rical  Res ults 

Extensive  numerical  computations  have  been  performed  for  ka  = 100  and  for 
various  surface  impedance  values  to  show  how  the  amplitude  and  phase  of  the  surface 
impedance  affect  the  field,  and  to  check  the  accuracy  and  range  of  validity  of  the 
various  formulations  in  Section  B. 

Concerning  the  propagation  constants  for  the  W.  G.  modes,  we  have  obtained 
solutions  of  the  differential  equation  (5b)  by  the  Runge-Kutta  method.  The  results  are 
shown  in  Figure  5.  Here  tj,  t^  and  t^  are  the  first  three  roots  of  Ai'(tRi)  = 0 (i.e., 
a = 0 in  Eq.  (5a)  and  t , t?  and  t are  those  of  Ai(t  ) = 0 (i.e,,  a = oo  in  Eq.  (5a)). 

The  solid  lines  and  the  dotted  lines  are  equi-phase  and  equi-amplitude  curves,  respec- 
tively. The  arrows  on  the  equi-phase  curves  indicate  the  directions  along  the  root 
loci  with  increasing  amplitude  |£  | . It  is  interesting  to  note  in  Fig.  5 that  the  first  zero 
tj  crosses  the  imaginary  axis  for  some  values  of  £,  thereby,  furnishing  the  slow  wave 
type  of  whispering  gallery  mode.  In  particular,  when  Retj  > 0,  Im  tj  = 0,  the  contri- 
bution from  tj  describes  a surface  wave  mode.  Except  for  argZ'=  ± 90°,  the  imaginary 
parts  of  the  whispering  ga'lery  mode  poles  become  large  as  the  amplitude  of  Z'  increases 
within  the  ranges  under  consideration  (i.e.,  |Z'j  < 0.6).  It  should  be  noted  in  Figs, 

2 and  5 that,  for  given  Z1  and  arg  Z'^  i 90°,  the  eigenvalues  for  modes  bound  close  to 
the  surface  have  large  imaginary  parts  compared  to  those  located  far  from  the  surface, 
thereby  de -emphasizing  the  importance  of  the  most  closely  bound  whispering  gallery 
modes.  This  is  to  be  expected  on  physical  grounds  since  the  field  of  the  tightly  bound 
modes  is  in  close  proximity  to  the  lossy  guiding  surface.  It  can  also  be  seen  from 
Fig.  2 that  the  two  approximations  given  in  Eqs.  (5a)  and  (7)  have  an  overlapping 
region  wherein  one  can  switch  from  one  to  the  other,  fd  = 4 has  been  found  satisfactory 
for  our  evaluation  of  the  field. 
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Fig.  5.  Roots  of  [Ai'lt^  + idAHtjp)]  as  obtained  from  (5b). 

The  numbers  on  the  solid  and  dashed  curves  denote 
the  phase  and  amplitude  of  £ = -ici,  respectively. 

In  Figs.  6 and  7,  we  have  compared  the  various  formulations  in  Section  B for  a 
purely  inductive  impedance  case  (|Z'|  =0.22  and  arg  Z1  = -90°)  to  check  accuracy  and 
range  of  validity.  For  this  case,  the  surface  wave  mode  is  excited  since  the  first  root 
of  Eq.  (5a)  or  Eq.  (5b)  is  located  on  the  positive  real  axis  in  the  t-plane.  The  ray  plus 
canonical  integral  plus  slow  wave  representation  in  Eq.  (19)  is  seen  to  provide  an 
excellent  approximation  provided  that  the  number  of  rays  (N+ 1 ) is  chosen  according 
to  the  criterion  in  Eq.  (20).  Because  of  the  overlap  of  the  curves  for  various  N, 
switching  from  one  formulation  to  the  other  can  be  performed  smoothly.  The  ray  plus 
canonical  integral  contribution  curves  are  shown  separately  to  exhibit  the  effect  of  the 
surface  wave  on  the  total  field. 

In  Fig.  7,  the  near  field  form  in  Eq.  (30)  and  the  ray  plus  whispering  gallery 
mode  plus  slow  wave  representation  in  Eq.  (27)  are  compared  with  the  whispering 
gallery  mode  (including  surface  wave)  plus  continuous  spectrum  representation  in  Eq. 

(8).  It  is  interesting  to  observe  that  for  large  enough  Y,  the  total  field  can  be  accurately 
represented  just  by  the  surface  wave  (M=l)  and  by  rays  provided  that  the  appropriate 
number  of  reflected  rays  is  included  as  Y or  |4>-<j>'|  increases. 
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Fig.  6.  Ray  plus  canonical  integral  plus  slow  wave  repre- 
sentation in  Eq.  (19).  with  Z'  = 0.22  exp(-i90°). 
Both  the  Y and  coordinates  are  indicated. 

Also  shown  are  the  ranges  in  Y and  corres- 

ponding to  N=  -1  ( — • — • — •)  (no  geometric -optical 
ray);  N = 0 ( — • — ) (1  ray);  N = 1 ( — . . — ) (2  rays); 
N=  2 ( — . . . — ) (3  rays).  The  solid  curve  is  calcu- 
lated from  Eq.  (8)  and  serves  as  the  reference 
solution.  The  heavy  curves  include  the  surface 
wave  in  Eq.  (19)  (U(t,)  = 1 for  this  case),  while 
the  light  curves  have  been  obtained  from  Eq.  (19) 
with  the  surface  wave  omitted. 


Curves  for  a high  loss  case  are  depicted  in  Fig.  8,  based  on  the  ray  plus  canonical 

integral  representation.  Having  in  mind  an  application  to  ground  wave  propagation, 

the  high  loss  impedance  Z'  = 0. 22  exp(-i30.  5°)  has  been  obtained  by  assuming  a wet 

-2  - 1 

ground  surface  with  conductivity  oe=  10  mho  m and  dielectric  constant  ee=  10  eQ, 
subject  to  a wave  frequency  f = 10  MHz.  Also  shown  in  Fig.  8 is  the  near  field  form 
for  small  Y.  It  should  be  noted  that  Gj  in  Eq.  (19)  does  not  contribute  since  no  pole 
singularity  is  located  between  the  contours  C and  C in  Fig.  1 (see  also  Section  B-l). 

All  of  the  modes  here  are  fast  waves  as  seen  from  Fig.  2.  Because  of  the  dissipative 
attenuation  of  the  modes  bound  close  to  the  surface,  the  canonical  integral  contribution 
is  found  to  be  negligible  for  y>  4,  thereby  establishing  the  adequacy  of  the  ray-optical 
contribution  alone  for  describing  the  field  sufficiently  far  from  the  source  point. 
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Fig.  7.  Ray  plus  W,  Cl.  mode  plus  slow  wave  represen- 
tation in  Eq.  (27),  and  near  field  form  in  Eq.  (30) 

(--),  with  Z'=  0.22  exp(-i90°).  The  reference 
solution  (solid  curve)  is  calculated  from  Eq.  (8). 

The  numbers  indicated  along  the  curves  should  be 
read  as  follows:  for  example,  M = 2 [N=  1 ] denotes 
the  range  wherein  the  direct  ray  and  singly  reflected 
rays  plus  2 modes  (1  W.G.  mode  and  1 surface  wave 
mode)  are  applicable,  while  the  circles  indicate  the 
starting  point  of  the  relevant  intervals.  The 
coordinate  is  not  shown  since  it  has  been  depicted  in 
Fig.  6. 

The  solid  curves  in  Figs.  6,  7 and  8 are  obtained  from  Eq.  (8)  with  32  (ka  = 
100).  This  form  of  the  solution  is  taken  as  a reference  for  all  Y.  It  was  noted  in  the 
study  of  the  perfect  conductor  case1  that  the  numerical  values  derived  from  the  W.  G. 
mode  plus  continuous  spectrum  representation  are  extremely  sensitive  to  the  exact 
number  of  modes  required.  Unless  all  32  modes  (for  ka  = 100)  are  included,  the 
perfect  conductor  was  found  to  deviate  appreciably  from  the  correct  shape.  The  same 
remark  applies  to  the  finite  surface  impedance  case  in  Figs.  6,  7 and  8 since  the 
higher  order  modes  have  small  attenuation  coefficients  (see  Fig.  2)  even  though  the 
losses  may  be  large.  Because  of  this  feature,  it  is  preferable  to  use  one  of  the  other 
representations  for  field  calculation. 


Fig.  8.  Ray  plus  canonical  integral  representation  in 
Eq.  (19),  and  near  field  form  (--)  in  Eq.  (30), 
with  Z'  = 0. 22  exp(-i30.  5°).  Note  that  U(tj)  = 0 
in  this  example  (see  Fig.  2).  The  reference 
solution  (solid  curve)  is  calculated  from  Eq.  (8). 

— • — • — N = -1; N = 0;  . . N=  1; 

and  — . . . N = 2. 


In  Fig.  9,  we  have  shown  how  increasing  the  amplitude  of  the  normalized  impedance 
Z'  affects  the  surface  field;  here  arg  Z'=  -45°.  Curves  are  depicted  based  on  the  ray 
plus  canonical  integral  representation,  and  on  the  near  field  form  for  small  V.  In  our 
computations,  |Z'|  = 0.05  has  been  chosen  as  a sample  of  a small  impedance,  and 
I Z | = 0.22  as  a sample  of  a large  impedance.  When  arg  Z'=  -90°,  corresponding  to 
a purely  inductive  impedance  boundary  which  supports  a slow  wave,  the  field  amplitude 
is  found  to  increase  as  | Z 1 1 increases  while  the  converse  is  true  when  the  boundary 
impedance  is  purely  capacitive  (see  Reference  7).  This  behavior  may  be  attributed  to 
the  greater  and  lesser  confinement,  respectively,  of  the  lowest  order  modal  field  in 
the  two  cases.  As  the  imaginary  part  of  roots  increases  (see  Fig.  5),  the  field  trans- 
mitted to  an  observation  point  tends  to  decrease  (Figure  9).  Because  of  the  absorption 
on  the  surface,  the  interference  effects  between  rays  are  also  weakened,  as  seen  from 
tne  decreasing  magnitude  of  oscillation  in  the  curves  in  Figure  9. 
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Fig.  9.  Influence  of  the  amplitude  of  the  surface  impedance 
Z1,  calculated  from  Eqs.  (19)  and  (30)  (--),  for  arg 
Z1  = -45°.  The  heavy  curves  represent  a large 
impedance  ( | Z 1 1 = 0.  22)  and  the  light  curves  a 
small  impedance  (Z’|  = 0.05).  Here,  U(t,)  = 0. 

— • — • — N=  -1; N=  0; — N=  1;  and 

— N = 2. 

The  curves  in  Fig.  10  show  the  effect  of  the  phase  of  Z1  on  the  field  when  the 
amplitude  remains  constant  at  l Z * l —0.05.  One  observes  that  the  field  tends  to  in- 
crease as  the  phase  of  Z1  deviates  from  zero;  this  could  have  been  predicted  from  the 


A A 


“i- 


/ \ ■■  \ !\  • \ ! a a i i 

\ / \ ! \ a !\l  ^ hj 

'Ai/rtMMiii! 


Fig.  10.  Influence  of  the  phase  of  the  surface  impedance 
Z1,  calculated  from  Eq.  (19),  with  |Z'|  = 0.05, 

g Z'  = 
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behavior  of  poles  in  Figure  5.  Note  that  the  maxima  and  minima  of  the  field  shift 
backward  for  negative  phase  (i.e.,  -90°  < arg  Z'<  0°),  and  they  are  found^  to  shift 
forward  for  positive  phase  (i.e.,  0°  < arg  Z'  <90°). 

D.  Conclusions 

A variety  of  alternative  representations  have  been  explored  in  this  study  of  the 
surface  fields  excited  by  a line  source  located  on  a concave  cylindrical  impedance 
boundary.  As  for  the  previously  investigated  perfectly  conducting  case,  * these  different 
formulations  provide  new  insights  into  the  propagation  mechanisms  that  prevail  for 
various  parameter  ranges  involving  the  boundary  shape,  separation  of  source  and 
observation  points,  and  the  surface  impedance.  From  the  detailed  discussion  and 
interpretation  of  the  numerical  results  in  Section  C emerges  the  role  played  by  the 
geometric  optical  field  and,  for  sufficiently  inductive  boundaries,  by  the  surface  wave. 
The  presence  of  the  surface  wave  leads  to  an  enhancement  of  the  field  over  that  ob- 
served on  a perfect  conductor.  When  losses  are  appreciable,  attenuation  of  mode 
fields  bound  close  to  the  boundary  establishes  the  geometric  optical  field  as  the 
dominant  and  adequate  constituent.  The  versatility  of  representations  involving  the 
canonical  integral  has  again  been  confirmed,  and  tabulations  have  been  provided 
for  several  values  of  surface  impedance. 

Army  Research  Office 

DAHC -04-75-G -01 52  T.  Ishihara  and  L.  B,  Felsen 

REFERENCES 

1.  T.  Ishihara,  L.  B.  Felsen,  and  A.  Green,  "High  Frequency  Fields  Excited  by 
a Line  Source  Located  on  a Perfectly  Conducting  Concave  Cylindrical  Surface,  " 
to  be  published  in  IEEE  Transactions  on  Antennas  and  Propagation. 

2.  L.  B.  Felsen  and  N.  Marcuvitz,  "Radiation  and  Scattering  of  Waves,  " Englewood 
Cliffs,  N.  J.  : Prentice  Hall,  Inc.,  Secs.  2.3,  3.4,  (1973). 

3.  V.  M,  Babich  anc  V.  S.  Buldyrev,  "Asymptotic  Methods  of  Short  Wave  Diffraction,  " 
Moscow,  USSR:  Nauka,  Chapter  1 1 , Section  4 (1972). 

4.  W.  Wasylkiwskyj,  "Exact  and  Qausi-Optic  Diffraction  within  a Concave  Cylinder,  " 
IEEE  Transactions  on  Antennas  and  Propagation,  Vol.  AP-23,  No.  4,  pp.  480-492 
(1975). 

5.  T.  Ishihara,  "Tabulation  of  a Canonical  Integral  Occurring  in  the  Theory  of  Wave 
Propagation  along  a Concave  Cylindrical  Impedance  Surface,  " submitted  for 
publication. 

6.  V,  M.  Babich  and  V,  S.  Buldyrev,  "Asymptotic  Methods  of  Short  Wave  Diffraction,  " 
Moscow,  USSR:  Nauka,  Appendix  4 (1972). 

7.  T.  Ishihara,  "High  Frequency  Behavior  of  Source  Excited  Concave  Surfaces,  " 

Ph.  D.  Dissertation  (Electrophysics),  Polytech.  Inst,  of  New  York,  (June  1978). 

8.  G.  Hasserjian  and  A.  Ishimaru,  "Currents  Induced  on  the  Surface  of  a Conducting 
Circular  Cylinder  by  Slot,  " J.  Res.  Nat.  Bur.  Stand.,  Vol.  66D,  No.  3,  pp.  335- 
365  (1972). 


ELECTROMAGNETICS 


17 


HIGH-FREQUENCY  SURFACE  FIELDS  EXCITED  BY  A POINT  SOURCE  ON  A 
CONCAVE  PERFECTLY  CONDUCTING  CYLINDRICAL  BOUNDARY 

L.  B.  Felsen  and  T.  Isihara 
A.  Introduction  and  Summary 

As  noted  previously,  when  a h gh-frequency  radiator  is  located  on  or  near  a con- 
cave surface,  the  fields  on  or  near  the  surface  cannot  be  calculated  by  conventional 
geometrical  optics,  although  the  observation  point  is  visible  from  the  source.  This 
circumstance  has  led  to  a detailed  study  of  alternative  formulations  for  evaluation  of 
the  surface  field.  The  interior  of  a circular  cylinder  has  served  as  a prototype  because 
of  the  availability  of  rigorous  field  solutions  for  this  configuration.  A detailed  study  of 
the  two-dimensional  axially  independent  problem*’  has  revealed  the  utility  of  field 
representations  that  incorporate  in  various  combinations  geometric  optical  contributions, 
whispering  gallery  modes,  continuous  spectra,  and  canonical  wave  species  analogous 
to  those  expressed  by  the  Fock  integral  for  convex  shapes.  This  previous  investigation 
has  provided  not  only  a quantitative  means  for  determining  the  field  but  also  new  insight 
into  the  propagation  characteristics. 

In  the  present  study,  the  analysis  is  generalized  in  Section  B to  three  dimensions 
by  considering  fields  excited  by  an  axial  magnetic  dipole  source  instead  of  a line  source. 
The  perfectly  conducting  circular  cylindrical  configuration  is  retained  for  the  reasons 
stated  above.  Because  of  a simple  Fourier  integral  relation  that  connects  two- 
dimensional  and  three-dimensional  Green's  functions  in  axially  uniform  cylindrical 
regions,  the  previous  results*  can  be  utilized  directly  in  the  analysis.  Therefore,  the 
presentation  here  is  limited  to  providing  only  the  major  steps  and  the  results  since  de- 
tails may  be  obtained  from  Reference  1.  Emphasis  is  placed  on  the  derivation  of  tractable, 
numerically  accurate  field  expressions  (Section  C)  that  incorporate  "invariant"  para- 
meters. This  feature  furnishes  not  only  a physical  interpretation  of  the  propagation 
mechanism  but  also  facilitates  subsequent  generalization  to  non-circular  shapes.  The 
utility  and  accuracy  of  these  expressions  is  assessed  in  Section  D by  extensive  numeri- 
cal calculations  which  also  highlight  limitations  that  inhibit  easy  extension  of  .certain 
two-dimensional  formulations  to  three  dimensions. 

A limitation  of  the  results  presented  here  is  the  inability  to  accommodate  observa- 
tion points  displaced  axially  or  nearly  axially  from  the  source.  As  for  the  convex  surface, 
the  field  behavior  in  the  paraxial  region  cannot  be  inferred  from  generalization  of  axially 
independent  solutions. 
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B.  Analytical  Formulation 

The  analytical  formulation  of  the  three-dimensional  problem  from  a knowledge  of 
the  two-dimensional  one  is  straightforward.  The  time -harmonic  three-dimensional 
Green's  function  G(r , r',  k),  where  r = ( £ , z)  is  the  observation  point,  r ' = ( £,  z')  is  the 
source  point,  and  k is  the  free-space  wavenumber,  can  be  obtained  from  the  two- 
dimensional  z independent  Green's  function  G(£,  £/,  k)  by  the  following  operation:^ 

. x _ ik  |z-z,j 

G(£.  £',  k ) = ± J G(£,  £ ',  kt)e  z dkz  (1) 

- 00 


where 


kt  = ■ kz  ’ Im  kt>°  (la) 

The  dependence  on  k in  G will  henceforth  be  omitted  in  the  notation.  A time  dependence 
exp(-iuit)  is  implied.  The  Green's  function  G is  assumed  to  have  a vanishing  normal 
derivative  on  the  cylindrical  boundary,  and  thus  represents  a scalar  potential  from 
which  the  electromagnetic  field  of  an  axially  oriented  magnetic  dipole  element  can  be 
derived. 

When  the  transformation  in  Eq.  (1)  is  applied  to  asymptotic  or  other  approximate 
forms  of  G,  care  must  be  taken  to  ensure  that  the  replacement  in  G or  k by  kt>  with 
- 00  < kz<  <»,  does  not  invalidate  assumptions  made  in  the  approximate  evaluation  of  5. 

In  Ref.  1,  it  was  assumed  that  ka  » 1,  where  a is  the  radius  of  curvature  of  the  cylin- 
drical surface.  Thus,  the  integration  path  in  Eq.  (1)  must  be  deformed  away  from  the 
branch  point  at  kz  = k.  Moreover,  the  major  contribution  to  the  integral  in  Eq.  (1) 
must  come  from  kz  values  that  retain  | k^a  | large.  Since  k^  represents  the  wavenumber 
transverse  to  z,  this  implies  that  all  relevant  wave  constituents  must  have  a large 
transverse  component,  i.e.,  observation  points  displaced  axially  or  almost  axially 
from  the  source  point  are  excluded.  Furthermore,  if  complex  values  of  k^  should  be- 
come relevant,  the  appropriate  analytic  continuation  from  k to  kt  must  be  performed. 
Subject  to  these  considerations,  most  of  the  two-dimensional  results  can  be  readily 
transformed  via  Eq.  (1)  into  the  three-dimensional  domain. 

A variety  of  different  field  representations,  each  with  its  own  physical  interpre- 
tation and  utility,  can  be  extracted  from  Eq.  (1).  They  are  obtained  from  the  corres- 
ponding representations  in  Ref.  1 upon  eliminating  the  kz  - integral,  when  possible,  by 
saddle  point  evaluation  with  the  aid  of  the  transformation  from  the  kz  - plane  to  the 

0-plane  via  k = k sin  0,  with  k = kcos  6.  Any  two-dimensional  field  constituent  that  is 
z t 
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of  the  form 


G = B(£,  g',  k)  exp(ikD)  (2) 

where  B is  a slowly  varying  amplitude  and  D is  a length  parameter,  gives  rise  to  a 
saddle  point  kzg  in  Eq.  (1)  located  at 

k*s=ksin0S'  es  = tan_1  <2a> 


with  ktg  = kcos  0g.  In  the  0-plane,  the  saddle  point  is  located  at  6g.  Because  of  the 
restrictions  noted  earlier,  the  range  0 ->  rr/2  must  be  excluded.  When  the  integral  in 
the  0-plane  in  then  evaluated  by  the  conventional  saddle  point  formula,  the  resulting 
three-dimensional  field  has  the  form: 


G< 


/ kcos*”© 

£,  r')  s 


^ » ikL-iTT/4 
B(g,  g , kcos0s)e  ' 


(3) 


where 


L = z-z  sin  0 + Dcos  0 
1 s s 

C.  Alternative  Field  Representations 
1 . Basic  Representation 

The  basic  integral  representation  is  given  by  Eq.  (1)  with1 

_ , 1 /•  «^pCi  v l4> -♦'I  ] 

G(g,  £ , k ) = J — --  dv 

i(rrkta)Z  c J^(kta) 


(3a) 


(4) 


where  the  contour  ^ and  the  singularities  of  the  integrand  in  the  complex  v-plane  are 
shown  in  Fig.  1,  and  the  prime  on  the  Bessel  and  Hankel  functions  denotes  the  derivative 
with  respect  to  the  argument.  The  source  and  observation  points  lie  on  the  large  circu- 
lar cylindrical  boundary  of  radius  a whereon  the  normal  derivative  of  G is  assumed  to 
vanish.  Note  that  the  formulation  in  Eqs.  (1)  and  (4)  contains  the  essential  information 
for  the  fields  along  a circular  concave  segment.  All  effects  that  would  represent  be- 
havior attributable  to  a closed  cylindrical  cavity  have  been  removed.  1 In  particular, 
the  azimuthal  periodicity  has  been  eliminated  by  formulating  the  field  problem  in  an 
infinite  angular  space  - oo  < <J>  < oe.  This  formulation,  equivalent  to  placing  "perfect 
absorbers  for  angularly  propagating  waves"  along  two  radial  planes,  introduces  spurious 
scattering  from  the  origin,  which  has  also  been  extracted. 


1 


Fig.  1.  Integration  paths  and  singularities  in  complex 


V of  H 
P V 


/(2) 


(kta); 


. . zeros  v 


ofj'  (ka). 


Also  shown  are  the  steepest  descent  paths  associated 
with  various  saddle  points. 


2.  Ray-optical  Representations 

The  ray -optical  solution  can  be  written  as 


G = V g 
Li  n 

n=0 


* , ,n  e*P<lkLn> 

5n“(-l)  tTL 


where 


L = I z-z/  I sin  0 + D cos  0 

n 1 1 sn  n 


sn 


(5) 


(6) 


(7) 


represents  the  actual  path  length  of  a ray  reflected  n-times  on  the  conducting  boundary, 
and  is  the  projection  of  L.^  on  the  transverse  cross-section. 

3.  Ray  plus  Canonical  Integral  Representation 

Since  the  caustics  for  large  n approach  to  the  boundary  p = a,  the  geometrical  ray 
series  is  not  applicable  to  rays  that  have  experienced  many  reflections.  It  is,  therefore, 
necessary  to  truncate  the  number  of  rays  at  some  n = N such  that  none  of  the  required 
assumptions  are  violated.  Thus,  instead  of  the  infinite  series  expansion,  one  may 
obtain 

N 

G = Z Sn  + RN  (8) 

n=0 


i 


A 


(8) 
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where 


N+2  ikR  + in/4  cos0  2/3 


2tt  a ^ZrrkR 


°°eiTT'3  fWj(t)  ->N+ 1 

'N<B  A'(t) W2'(t)  U'(t)  ^ 


= (^)1/5k.vi  = (^)l/3i 


R=  Iz-z'IsinO  + s cos  0 (II) 

1 1 s 8 

The  number  of  rays  (N+l)  and  the  arc  length  parameter  g are  related  by  the  criterion 


2(2  ' 3 A) 


1 < (N+l)  < 


2(2  ' JA) 


, A = A(ka)  = 0(1) 


The  geometrical  quantities  R,  the  distance  between  Q and  P along  the  geodesic  on  the 
cylinder  surface,  and  0g,  the  geodesic  departure  angle,  are  shown  in  Fig.  2.  Except 
for  the  replacement  of  Y by  0,  Lj^(g)  is  the  same  as  I^Y)  and  has  been  tabulated  else- 
where^’ ^ for  selected  ranges  of  0 and  N. 

4.  Ray  plus  Whispering  Gallery  (W.G.)  Mode  Representation 


It  may  be  shown  that  the  remainder  integral  in  Eq.  (9)  can  be  simplified  so  that 
one  obtains  instead  of  Equation  (8): 

G = Z Cn  + Z Grn  • I CN  (l3) 

n=0  m=l 

The  m-th  three-dimensional  W.  G.  mode  Gm  is  given  by 


ikR+i? 


G - 
m 


4 ,kacos8  v2/3  i^°m 


f 2tt  a kR 


where  c are  determined  by 
m 


Ai  (-o  ) = 0 , m = 1,  2,  . . . , M 


. - «•  •'  '• 
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Fig.  2.  Geometrical  quantities  pertaining  to  whispering  gallery  mode. 

R = R(0s)  is  the  geodesic  along  the  surface  between  the  source 
and  observation  points. 


The  number  of  rays  (N+l)  is  determined  by  the  inequality  Equation  (12).  Because 
of  Eq.  (12),  larger  values  of  g are  required  for  inclusion  of  higher  order  reflected  rays. 
For  given  N,  as  g increases  in  its  applicable  range,  a new  W.G.  mode  must  be  included 
v/henever  the  N-times  reflected  ray  touches  the  caustic  of  that  mode.  The  process  for 
proper  selection  of  N and  M is  schematized  in  Fig.  3,  where  each  concentric  cylinder 
depicts  the  modal  caustic  of  a three  dimensional  whispering  gallery  mode.  For  example, 
within  the  ranges  of  P.  , P?  and  P , one  may  choose  N=  0 (M=  1),  N = 0 (M  = 3)  and 
N = 1 (M=l),  respectively.  It  should  be  noted  that  in  the  simpler  two-dimensional  case, 
we  were  able  to  describe  the  field  by  a single  (direct)  ray  plus  W.  G.  modes  for 
Y>  2(2*/^A)^^  « 3.  1 since  it  was  possible  to  adequately  represent  the  required 
number  of  modes,  even  those  of  higher  order.  However,  in  the  three-dimensional 
case,  use  of  the  Fock  asymptotic  approximations  restricts  the  applicability  of  the 
formulas  to  the  lower  order  modes.  This  aspect  is  discussed  further  in  Section  D. 
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None  of  the  above  representations  applies  in  the  limiting  case  (3  -»  0 in  Eq.  (10), 


which  arises  either  when  s and  |z-z'|-*0  with  a and  0 fixed,  or  when  a -»  oo  with  s and 

Is 

0 fixed.  For  this  case,  one  may  show  that 


1 1rR  i 

G * fnir  ^ + E bj  e(3j/2,  + °(e3V2)} 

j=i 


6.  Whispering  Gallery  Mode  plus  Continuous  Spectrum  Reoresentation 


In  the  two-dimensional  case,  it  was  feasible  to  derive  a field  representation  in 


terms  of  the  totality  of  W.  G.  modes  plus  a continuous  spectrum  that  accounts  primarily 
for  near  field  effects.  * This  formulation  served  as  an  accurate,  though  not  necessarily 


convenient,  reference  solution  since  it  was  possible  to  render  both  of  these  fields 
constituents  numerically  tractable.  However,  the  additional  integration  in  Eq.  (1)  re- 
quired for  the  three-dimensional  case  introduces  complications  that  make  the  numerical 
treatment  impractical  and  inaccurate.  For  this  reason,  we  have  omitted  this  repre- 
sentation from  the  present  study. 


D.  Numerical  Results 


Numerical  computations  have  been  performed  for  a circular  cylindrical  boundary 
with  ka=  100,  50,  20  and  initial  geodesic  departure  angles  0=  0°,  30°,  60°  and  80° 

(ka=  100).  Because  of  the  restriction  that  the  main  contributions  to  the  basic  integral 
in  Eq.  (1)  must  arise  from  k^  values  that  retain  |kta|  large,  we  have  chosen  the 


largest  geodesic  departure  angle  as  0 = 80°  for  ka  = 100  and  as  0 = 60°  for  ka  = 50  and 

s s 


■ 
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20.  This  validates  use  of  the  various  asymptotic  approximations  for  G.  In  each  figure, 
the  field  magnitudes  |g|  for  different  parameters  6g  are  plotted  versus  0 in  Eq.  (10), 
which  is  the  universal  parameter  incorporating  a,  |<t>-<))/|  and  0g.  Corresponding  |4>-(j/| 
coordinates  for  different  0g  are  depicted  in  Fig.  4(a)-(c).  Note  that,  for  given  0,  the 
geodesic  distance  R in  Eq.  (11)  and  the  angular  distance  |<)>-4/|  increase  as  cos  6g  or 
ka  (.1  fixed)  decreases  (see  also  Figure  2).  For  example,  at  0=  1,  one  has  R»2.7 
(6=0°),  R « 3 . 2 (0  =30°),  R » 6.  8 (6  =60°),  R » 28  (0  = 80°)  for  ka  = 100  (a  = 10  meter), 
and  R«<4.6(0g=O  ),  R«5.6(0g=3O°).  R»11.7(0g=6O°),  forka=20(a=10meter),  where 
R is  in  meters.  Since,  for  fixed  ka  and  fixed  0,  there  is  the  simple  relation  lajGfa^l  = 
|a2G(a2)  | , where  G(aj)  and  G(a2)  are  calculated  from  Eqs.  (8)  or  (13)  with  a = a^  and 
a.  = a^,  respectively,  one  need  evaluate  the  field  for  only  one  combination  of  k and  a. 

Then  use  of  the  relation  above  yields  the  field  for  arbitrary  radius  provided  that  (ka) 
is  unchanged.  First  we  shall  discuss  the  accuracy  and  range  of  validity  of  the  various 
formulations  in  Section  C. 


In  Fig.  4,  we  have  obtained  the  field  magnitudes  for  various  ka  and  0g  based  on 

the  ray  plus  canonical  integral  representation  in  Eq.  (8)  which  serves  as  an  accurate 

and  convenient  reference  solution.  When  ka  = 20  and  0 =60°,  then  k^a  = ka  cos  0 =10, 

s t s 

so  that  one  may  ask  whether  the  requirement  of  large  ka  cos  8 is  satisfied  here.  From 

1 s 

experience  with  the  two-dimensional  case,  confidence  is  established  that  the  asymptotic 
formulations  in  Section  C are  applicable  for  these  parameter  values  and  hence  for  the 
others  listed  above.  The  number  of  rays  to  be  included  in  Eq.  (8)  is  determined  by  the 
inequality  Eq.  (12),  while  the  canonical  integral  1^(0)  in  Eq,  (9b)  is  evaluated  numeri- 
cally. A tabulation  of  IM(0)  may  be  found  in  Reference  5.  As  in  the  two-dimensional 
1 w 

case,  the  choice  of  A in  Eq.  (12)  is  not  very  critical;  we  have  used  A«2  throughout. 
Because  of  the  overlap  of  the  curves  for  various  N (see  Fig.  4l,  switching  from  one 
formulation  to  the  other  is  performed  smoothly. 


The  mix  of  (N+l)  rays  and  M whispering  gallery  modes  in  Eq.  (13)  provides  the 

most  intriguing  and  physically  appealing  formulation.  As  noted  in  Section  C-4,  the 

whispering  gallery  mode  representation  Eq.  (14)  is  restricted  to  the  lower  order  W.  G. 

modes  since  we  have  used  the  Fock  asymptotic  approximations*  which  are  valid  only 

for  Vs*  kta  and  kta  large.  The  eigenvalues  vm  in  the  v-plane  and  t in  the  t-plane  are 

related  by  vm  = kta  + (kta/2)*^tm>  where  the  t^fm  = 1,  2,  . . . , M)  are  the  zeros  of 

Ai^t).^  Note  that  the  t^  are  negative  real,  and  that  their  magnitude  increases  with 

increasing  m,  the  order  of  the  mode.  Therefore,  the  restriction  v »k  a required 

m t 

for  applying  the  Fock  asymptotic  approximations  is  violated  at  a certain  number  (M+l). 

This  limitation  did  not  arise  in  the  two-dimensional  case,  * where  we  have  em- 
ployed the  uniform  asymptotic  approximations,  which  are  valid  for  the  entire  range  of 
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Fig.  4(a)  ka  = 100  (a  = 10  meter,  f = 477.  5 MHz). 

The  lightest  curve  is  obtained  from 
Eq.  (19)  with  eg  = 80°. 

Fig.  4.  Ray  plus  canonical  integral  representation.  Field  magnitude 
N 

|g|  | S G + RM  | in  Eq.  ( 19)  is  calculated  for  various 

n=0  N 

geodesic  departure  angles.  0S  = 0°  (heavy  curve),  0S  = 30° 
(medium  curve),  08  = 60°  (light  curve).  Both  the  6 and 
| 4> - <f>’  | coordinates  are  indicated.  Note  that  the  { <p - <f>'  | 
coordinates  are  depicted  by  different  scales  for  each  0 
Also  shown  are  the  ranges  in  (3  corresponding  to  N = - 1 

( ) (no  geometric- optical  ray);  N = 0 ( ) (1  ray); 

N = 1 ( — •— ) (2  rays);  N = 2 ( — ) (3  rays). 


ELECT  ROMAGN  ET IC  S 


27 


v and  large  ka,  instead  of  the  Fock  approximations  to  obtain  asymptotic  expressions 
for  the  W.  G.  modes.  But  to  obtain  the  three  W.  G.  modes  from  the  two-dimensional 
ones,  the  additional  integration  in  Eq.  (1)  is  required  and  can  be  carried  out  conveniently 
(see  Eq.  (3))  only  when  the  two-dimensional  field  constituent  has  the  form  in  Equation 
(2).  Such  a form  is  exhibited  by  the  two-dimensional  W.  G.  modes  obtained  with  the 
Fock  approximations  but  not  by  those  obtained  with  the  uniform  approximations  (D  in 
Eq.  (2)  is  in  general  a complicated  function  of  k).  Thus,  while  it  was  possible  (for 
V > 3.  1)  to  represent  the  two-dimensional  field  by  a single  ray  plus  an  appropriate 
number  of  W.  G.  modes,  * this  cannot  in  general  be  done  with  our  results  for  the  three- 
dimensional  field.  Note,  however,  that  when  the  observation  point  is  on  the  transverse 
cross-section  (i.e.,  0s  = O°),  Eq.  (2)  still  holds  for  the  uniform  expression  of  the  W.  G. 
modes,  thereby  making  it  possible  to  represent  the  field  by  a single  ray  plus  W.  G. 
modes  for  this  special  case. 

The  fields  computed  from  the  ray  plus  whispering  gallery  mode  representation 
in  Eq.  (13)  and  from  the  near  field  form  in  Eq.  (15)  are  compared  with  the  reference 
solutions  (solid  curves)  obtained  from  Eq.  (8)  in  Figure  5(a)-(c).  The  ray  plus  whisper- 
ing  gallery  mode  representation  is  invalid  in  the  near  field,  when  the  left-hand  side  of 
the  inequality  in  Eq.  (12)  is  negative;  here,  one  may  use  the  near  field  form.  From 
each  of  the  curves  in  Figs.  5(a)-(c),  it  is  seen  that  changeover  from  the  near  field  form 


lN:0(l){  \ 


V 


N=  O (♦) 


NxO(2) 

L x 


N«2(l) 

Si 
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f ig.  5(a)  ka  = 100  (a  = 10  meter).  The  lightest  curve  s calculated  from 
Eq.  (27)  with  08  = 80°. 

Fig.  5.  Ray  plus  whispering  gallery  mode  representation  in  Eq.  (27)  and  near  field  form 
in  Eq.  (32)  ( ).  The  reference  solution  (solid  curve)  is  calculated  from  Eq. 

( (19).  The  numbers  indicated  along  the  curves  should  be  read  as  follows;  for 

example,  N=  1(2)  denotes  the  range  wherein  the  direct  and  singly  reflected  ray 
plus  2 whispering  gallery  modes  are  applicable,  while  the  circles  indicate  the 
starting  points  of  the  relevant  intervals.  The  heavy,  medium  and  light  curves 
are  obtained  from  Eq.  (27)  with  0g  = 0°,  0g  = 30°  and  0 = 60°,  respectively. 
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N: 

N = l(l)  V 


Fig.  5b)  ka 


8 9 


Nr  0(2) 


Fig.  5(c)  ka  = 20 


to  the  direct  ray  plus  lowest  W.  G.  mode  form  can  be  performed  relatively  smoothly 
at  the  crossover.  In  the  near  field  computation,  11  terms  in  the  series  expansion  have 
been  used  since  this  has  yielded  the  smoothest  changeover.  It  is  also  seen  to  be 
possible  to  represent  the  field  accurately  by  only  the  fundamental  W.  G.  mode  plus  a 
proper  number  of  rays  determined  by  the  inequality.  These  excellent  approximations 
are  valid  if  the  projection  of  the  observation  point  on  the  transverse  cross-section  is 
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characterized  by  parameter  values  away  from  the  region  of  coalescence  of  a modal 
pole  and  a saddle  point  in  the  remainder  integral  R^^  (see  Ref.  1)  which  corresponds 
to  the  geometrical  contact  of  ray  and  modal  caustic  as  shown  in  Figure  3.  The  jumps 
in  each  of  the  curves  in  Figs.  5(a) -(c)  correspond  to  these  points  and  can  be  avoided  by 
resorting  to  a Fresnel  integral  formulation  of  the  asymptotic  field  in  this  transition 
region.  However,  as  in  the  two-dimensional  case,  we  have  found  that  continuity  can 
be  established  by  using  the  simple  result  for  coincident  pole  and  saddle  point  and  then 
resorting  to  a perturbation  expansion. 

Next,  we  discuss  the  field  behavior  as  a function  of  various  physical  parameters. 
To  explore  the  dependence  on  frequency  and  on  the  geodesic  departure  angle,  it  is  more 
convenient  to  plot  |g|  vs.  Rcurves,  where  R is  the  geodesic  distance  between  source 
and  observation  point  defined  in  Equation  (11).  Figure  6 exhibits  the  field  for 

|G| 


Fig.  6.  Frequency  dependence  of  the  field,  a = 10  meter,  0g  = 30°. 


k=2tr/\  = 2,  5,  10,  with  a_  and  0g  fixed  at  10  meter  and  30°,  respectively,  while  in 
Fig.  7,  0g  takes  on  values  of  0°,  30°,  60°,  80°,  and  ka  = 100  (a  = 10  meter)  is  fixed. 

It  is  evident  from  the  figures  that  the  near  field  decreases  with  the  frequency  and  with 
increasing  0g.  This  behavior  indicates  that,  for  an  isotropic  radiator,  the  near  field 
(R  < 10)  is  enhanced  on  the  transverse  cross-section,  where  0g  = 0°,  and  at  higher 
frequencies.  Interference  effects  between  rays  are  weakened  for  decreasing  k and  for 
increasing  0g,  as  seen  from  the  decreasing  number  of  oscillations  in  the  curves.  The 
accompanying  shifts  of  the  field  maxima  and  minima  to  the  right  indicate  that  when  ob- 
served along  the  geodesic  path,  the  ray  field  develops  more  quickly  at  high  frequencies 
and  at  small  geodesic  departure  angles  than  at  lower  frequencies  and  at  larger  geodesic 
angles . 
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Fig.  7.  Various  geodesic  departure  angles,  and  comparison  with 

perfectly  conducting  planar  boundary,  ka  = 100  (a  = 10  meter). 
Points  A indicate  the  geodesic  distance,  beyond  which  the 
first  ray  can  be  identified. 


Also  shown  in  Fig.  7 for  comparison  is  the  field  for  a perfectly  conducting  planar 
boundary.  From  the  restrictions  imposed  on  our  computations,  we  cannot  place  the 
observation  point  near  the  axis  of  the  concave  cylinder;  yet,  even  the  curve  for  0 = 80° 
(ka=  100)  is  valid  since  kacos  0g  = 17  is  sufficiently  large.  It  is  interesting  to  note  that 
in  this  latter  example,  where  the  geodesic  path  experiences  weak  curvature,  no  ray 
develops  at  all  over  the  entire  range  of  R considered  (0  < R<  56),  and  that  the  field 
resembles  the  one  on  a perfectly  conducting  planar  boundary  over  a substantial  distance. 
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PERIODIC  STRUCTURE  GTD  FOR  ANALYSIS  OF  MUTUAL  COUPLING  IN  ARRAYS 
ON  CONCAVE  SURFACES* 

H.  Ahn  and  A.  Hess  el 
A.  Introduction 

Arrays  on  concave  surfaces  form  an  integral  part  of  electronically  scanned 

1 

space-fed  (feed-through)  lenses  of  the  dome  antenna,  or  the  multibeam  bootlace  type. 

This  report  surveys  the  development  of  the  so-called  Periodic  Structure  Geo- 
metric Theory  of  Diffraction  (PSGTD)  for  evaluation  of  coupling  (scattering)  coefficients 
in  "collector"  arrays  of  aperture  elements  concave  surfaces.  The  method  of  analysis 
and  some  partial  results  for  far  mutuals  were  briefly  described  in  JSTAC  Report  No. 

42.  We  have  since  carried  out  the  development  of  the  relevant  transition  function  for 
the  evaluation  of  the  near  zone  coupling.  For  the  sake  of  clarity  and  continuity  we 
shall  review  the  entire  development. 

The  development  of  PSGTD  has  been  motivated  by  the  low  efficiency  (or  the  poor 

convergence)  in  concave  geometry  of  the  usual  method  of  evaluating  the  mutual  coupling, 

which  relies  on  numerical  calculation  of  the  mutual  admittance  coefficients,  followed 

by  inversion  of  the  admittance  matrix.  The  convergence  difficulties  arise  from  the 

very  slow  decay  of  mutual  interaction  in  concave  geometry,  as  compared  to  that  in 

2 

arrays  on  planar  or  convex  surfaces.  The  reason  for  such  a slow  decay  is  the  presence 
of  line  of  sight  between  each  of  the  array  elements  in  concave  geometry.  To  circum- 
vent this  difficulty  and  to  develop  a suitable  alternative  approach,  it  is  necessary  to 
depart  from  a canonical  problem  associated  with  a conducting  surface  and  begin  the 
analysis  with  an  appropriate  prototype  problem,  specifically,  that  of  coupling  coef- 
ficients in  an  angularly  periodic,  circularly  cylindrical,  concave  array  (Fig.  1).  In 
that  case,  the  high  frequency  asymptotic  evaluation  of  fields  or  coupling  coefficients 
gives  rise  to  a variant  of  the  geometric  theory  of  diffraction,  referred  to  as  Periodic 
Structure  GTD,  because  the  relevant  canonical  launching  reception,  and  reflection 
problems  are  associated  with  periodic  planar  arrays,  rather  than  with  conducting 
planar  surfaces,  as  in  the  standard  GTD  practice. 

The  virtues  of  the  PSGTD  in  application  to  arrays  on  concave  surfaces  are: 

a)  The  results  are  generalizable  to  slow  variation  of  curvature  and  periodicity. 

b)  The  method  employs  planar  phased  array  results  as  canonical  problems  for 
launching,  attachment  and  reflection  coefficients. 

c)  It  permits  exploitation  of  planar  phased  array  computer  programs,  and  does 
not  require  inversion  of  large  matrices. 

d)  It  furnishes  simple  physical  explanation  of  the  characteristic  features  of 
numerical  results. 

I 

i 


This  material  was  presented  at  the  1978  URSI  General  Assembly  in  Helsinki,  Finland. 
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Fig.  I.  Cylindrical  collector  array. 

As  will  be  seen  subsequently,  the  periodic  structure  rays  travel  along  straight 
lines  and  their  geometric  optical  trajectories  are  identical  with  those  occurring  when 
both  the  source  and  the  observation  point  are  located  on  the  unperforated,  concave, 
conducting  array  support  surface.  These  ray  fields  represent  local  plane  waves, 
radiated  by  the  excited  element  and  observed  in  the  receiving  element,  in  a matched 
terminated  array  environment.  Their  launching  and  reception  (attachment)  coefficients 
are  no  longer  those  on  a planar  conducting  surface,  but  correspond  to  planar  periodic 
arrays  appropriate  to  the  .respective  environments  of  the  transmitting  or  the  receiving 
element  on  the  concave  surface.  In  passing  from  the  former  to  the  latter,  the  ray 
fields  are  weighted  by  the  angle  dependent  launching  and  attachment  coefficients  which 
are  proportional  to  the  transmitting  and  receiving  element  patterns  in  infinite  planar 
arrays  with  elements  and  lattice  corresponding  to  the  respective  local  array  geometries 
on  the  concave  surface.  As  in  the  case  of  a smooth  concave  conducting  surface,  there 
appears  here  also  a direct  ray  and  a series  of  multiple  reflected  ray  contributions. 

The  reflection  coefficients  of  the  latter  are  no  longer  from  a perfectly  conducting  surface 
but  correspond  to  a plane  wave  incidence,  in  the  ray  direction,  on  a periodic  planar 
array  with  a lattice  appropriate  to  that  of  the  local  environment  surrounding  the  point 
of  reflection. 
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Similarly  to  the  GTD  formalism  on  smooth  conducting  surface,  the  periodic 
structure  ray  expressions  are  valid  in  the  "far"  zone  only;  in  the  proximity  of  the 
source  a transition  function  is  required. 


B.  The  Concave  Array  Model  and  Basic  Steps  in  Development  of  PSGTD 


The  canonical  collector  array  model  shown  in  Fig.  1 is  a uniformly  spaced  array 
of  N open-ended  "single  mode"  parallel-plate -guides  in  a concave,  circularly  cylindrical 
conducting  surface.  The  waveguides  propagate  only  the  dominant  TEM  mode  and  are 
equipped  with  a network  that  would  broadside  match  an  infinite  planar  phased  array 
(termed  the  reference  array)  with  the  same  elements  and  lattice  as  the  cylindrical 
array.  The  integral  representation  for  the  coupling  coefficients  is 


S 
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N/v  -j2j!V{f+qN) 

f T(v)  e w dv, 

-N/v 


q = 0 

q t 0 


source  term 

(1) 

image  terms. 
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In  Eqs.  (1)  and  (2)  I"(v)  is  the  active  reflection  coefficient  corresponding  to  a Floquet 

index  v in  the  wedge-shaped  unit  cell  of  Fig.  1,  and  Y(v)  is  the  corresponding  active 

aperture  admittance.  Y (o)  = G (o)+jB  (o)  is  the  active  broadside-scan  admittance 

P P P 

of  the  unmatched  reference  array. 

Steps  in  the  asymptotic  evaluation  of  s’j  : 

1.  Debye  approximation  lor  J i i(kro)  jj  v ((kr^). 

' m ' ’ m ' 

2.  Expansion  of  T(v)  into  Neumann  series. 

3.  Stationary  phase  evaluation  term-by-term  of  the  series  yields  periodic 
structure  ray  contributions  to  "far"  zone  mutuals, 

4.  For  near  neighbors:  Airy  function  approximation  for  J and  contour  deforma- 
tion yield  the  transition  function. 
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C.  Periodic  Structure  Ray  Fields 

The  stationary  phase  (or  the  periodic  structure  ray)  expressions  for  the  coupling 
coefficients  S^  are  shown  in  Fig.  (2)  along  with  a number  of  t/pical  rays.  One  notes 
that  Sj  is  a superposition  of  various  ray  contributions  S^n.  Numerical  results  show 
that  only  a small  number  of  rays  is  required  for  a sufficient  accuracy.  On  the  basis 
of  the  phase  term  one  concludes  that  the  ray  paths  are  straight  lines,  identical  to  those 
for  a magnetic  line  source  located  in  the  center  of  the  aperture  of  the  excited  element, 
for  observation  point  in  the  center  of  the  aperture  of  the  receiving  element.  Each  ray 
is  described  by  three  indices.  The  subscript  f denotes  the  receiving  element,  the 
superscript  q^O,  indicates  the  image  source  from  which  the  ray  originates  (such  rays 
sweep  through  an  angle  larger  than  180°,  subtended  at  the  center  before  reaching  the 

EXCITEO  ELEMENT 
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Fig.  2.  Periodic  structure  ray. 

receiving  element  (positive  (or  negative)  values  of  q correspond  to  rays  traveling  clock- 
wise (or  counterclockwise)).  The  index  n denotes  the  number  of  reflections  of  the  ray 
during  its  passage  from  the  transmitting  to  the  receiving  element.  Figure  2 shows  the 
direct,  singly  and  doubly  reflected  q = 0 rays  and  tne  singly  and  doubly  reflected  q=  -1 
rays.  One  notices  that,  in  view  of  the  circular  cylindrical  geometry,  the  ray  exit  angle 
0^n  at  the  transmitting  element  equals  to  the  incidence  angle  of  this  ray,  both  with 
respect  to  the  appropriate  local  normals.  When  the  receiving  element  approaches  the 
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source  6^n  approaches  90°  and  the  ray  path  becomes  too  short  for  the  ray  concept  to 

be  valid.  In  formal  terms,  the  Debye  approximation  of  J breaks  down  at  the  stationary 

2-2  v 

point.  The  appearance  of  the  factor  Tp  and  is  the  basic  feature  of  PSGTD  rays  and 

is  understood  as  follows.  The  factor  T (sin0.9n)  is  the  transmission  coefficient  of  an 

P * 

infinite  planar  phased  array  (the  reference  array)  located  tangentially  at  the  excited 
element  site  (Fig.  2)  and  scanned  to  an  angle  0^n  off  broadside . The  elements  and 
lattice  of  this  array  are  appropriate  to  the  local  geometry  of  the  cylindrical  array  in 
the  vicinity  of  the  excited  element.  From  phased  array  theory  Tp(sin0j)  is  also  the 
pattern  in  the  ray  direction  of  the  transmitting  element  in  the  above  match-terminated 
array  environment.  The  appearance  of  Tp(sin0^n)  corresponds  to  the  product  of  the 
launching  and  attachment  coefficient.  The  latter  is  recovered  from  the  reception 
problem  of  a plane  wave  incident  in  the  ray  direction  on  a planar  array  tangential  to 
the  cylinder  at  the  receiving  element  location  (Figure  2).  In  view  of  circular  cylindri- 
cal geometry  and  reciprocity  the  attachment  coefficient  is  also  proportional  to  T (sind’J”). 

a q n P * 

The  factor  T (sin  9^  ) is  the  plane  wave  reflection  coefficient  incident  in  the  ray  direc- 

tion  on  an  infinite  planar  array  tangential  to  the  cylinder  and  with  a local  lattice  at  the 
point  of  reflection.  Finally,  the  factor  eJnTT/2  indicates  that  an  n times  reflected  ray 
is  n times  tangent  to  a caustic. 

Numerical  Results 

Figure  3 shows  a comparison  between  the  exact  modal  solution  for  magnitude  of 
coupling  coefficients  and  the  PS  ray  results  for  the  array  of  Fig.  1 with  kr  = 84.6 
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and  with  the  total  number  of  180  elements  spaced  0.47X  apart.  The  exact  solution 
indicates  a |S^  | curve  falloff  which  follows  initially  that  in  a planar  reference  array. 
Gradually,  however,  the  decay  rate  of  | | versus  the  element  number  decreases  and 

the  curve  passes  through  a dip  with  a subsequent  rise,  accompanied  later  by  a distinct 
ripple. 

All  ray  contributions  belong  to  the  q = 0 family.  The  direct  ray  (Fig.  3)  contri- 
bution (dashed)  sets  the  average  level  of  the  curve  past  the  depression  and  departs 
from  the  exact  solution  near  the  source  for  f < 14.  Superposition  of  the  direct  and  the 
singly  reflected  ray  yields  the  rimmed  circles  and  demonstrates  that  the  characteristic 
ripple  is  caused  predominantly  by  the  inte reference  between  the  direct  and  the  singly 
reflected  ray  fields. 

The  rise  of  the  curve  past  the  dip  region  can  be  simply  explained  in  terms  of  the 
direct  ray  result.  By  the  phased  array  theory  and  geometry 


as  t increases,  0°°  decreases  and  v cos  0°°increases . Also,  as  0 °°  decreases  the 
planar  active  array  reflection  coefficient  likewise  decreases,  since  the  elements  are 
broadside  matched.  As  a result  | | increases  as  the  receiving  element  approaches 

the  diametrically  opposite  position.  Figure  4 shows  the  improvement  in  the  ray  result 

! Sjl  dB 


38 


ELECTROMAGNETICS 


I 


when  five  most  significant  rays,  q=0,  n = 0,  1 , 2 and  q = - 1 , n=l,2,  are  incorporated. 
Figure  5 indicates  a similar  agreement  for  an  array  with  about  half  the  radius  (kr  - 42). 
Again,  the  correlation  is  excellent  beyond  the  center  of  the  dip.  The  phase  of  calcu- 
lated by  the  ray  approach  tracks  that  of  modal  analysis  to  within  a few  degrees. 


|Sfl|dB 
0 
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Fig.  5.  Comparison  of  modal  and  ray  analysis. 
D.  Near  Zone  Transition  Function 
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When  the  receiving  element  approaches  the  source  all  image  terms  q / 0 in  Eq,  ( 1) 
will  be  represented  by  ray  contributions  and  only  the  q = 0 term  requires  a transition 
function,  since  the  q = 0 integral  in  Eq.  (1)  on  the  real  axis  cannot  be,  as  is,  generalized 
to  a variable  curvature.  The  development  of  the  transition  function  is  similar  to  that 
employed  for  a cylindrical  cavity  with  surface  impedance  wall  in  Ref.  (3),  which  in  turn 
follows  an  ideal  of  Ref.  (4)  for  the  case  of  a perfectly  conducting  cavity. 


Since  the  integrand  in  Eq.  (1)  is  oscillatory  on  the  real  v axis,  it  is  essential  to 
deform  the  contour  in  such  a way  as  to  localize  the  major  contribution  to  a vicinity  of 
V=  krQ.  The  result  is  the  following  transition  function 
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a(krQ)  = 2 
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( ^ ' Y*(0)  I Y;(kro)  j 


Lob  /sin  kb/2  \2  1 /kroN 

‘ ° = ’V-0  J t-wf)  v*(0,  * Y^r  ) ' M 

P p o' 


i r . fkb  . mX.!"]2 

v/,.  co  b v slnL^~(1  + ~)J  i 

Yp  kr°  * d u.  / . mX  ^ kb  , r— t ^ 

° WoL  l1  + — y-T 

where  the  integration  variable  t arises  from  the  usual  transformation 
,kr  . 1/3 

V = kroMT£)  4 (8) 

The  details  of  the  contour  are  given  in  Figure  6.  Figures  7(a)  and  7(b)  show  a compari- 
son between  the  exact  modal  result  (black  circles),  the  transition  function  for  cylinder 
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Fig.  6.  Contour  for  near  zone  transition  function. 
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radii  krQ  = 84.6  and  krQ  = 42.3  and  the  planar  result.  It  is  seen  that  the  agreement  is 
quite  good,  but  for  the  first  neighbor  ( t = 1)  one  may  wish  to  use  the  planar  result  for 
better  accuracy.  On  the  other  hand,  there  is  a fair  amount  of  overlap  in  the  dip  region 
between  the  ray  results  of  Figs.  4 or  5 and  the  transition  function  of  Figures  7(a)  and 
7(b). 


ISjildB  lS*|dB 


Fig.  7.  Near  field  transition  function. 

a)  krQ  = 84.  6,  b)  krQ  = 42.  3. 

In  order  to  present  the  behavior  of  the  integrand  in  the  neighborhood  of  t = 0 and 
particularly  along  the  contour  of  integration,  one  casts  the  entire  integrand  in  Eq.  (4) 
into  an  exponential  form  with  w(t)  = u(t)  + jv(t).  The  plot  of  the  surface  u(t)  in 

the  neighborhood  of  t = 0 (v  = krQ)  is  shown  in  Fig.  8 along  with  the  footprint  of  the 
contour  of  integration.  Two  poles  due  to  Ai7+  jzAi  = 0 are  shown  on  the  left  and  the 
first  pole  due  to  w'  + jzwj  = 0 is  shown  on  the  right.  It  is  seen  that  values  of  u(t), 
where  e11^  corresponds  to  the  amplitude  of  the  integrand,  decay  on  either  side  of  t=0, 
indicating  a localization  of  the  contributions  to  the  vicinity  of  t=  0. 
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LIMITATIONS  ON  GAIN  VERSUS  SCAN  FOR  A DOME  ANTENNA 
H.  Steyskal,  A.  Hessel  and  J.  Shmoys 

This  study  considers  limitations  on  the  gain  and  its  scan  variation  in  a 2- 
dimensional  dome  antenna,  as  shown  in  Figure  1.  An  upper  bound  on  the  average 


Fig.  1.  Two-dimensional  dome  antenna.  L=  feed  array 
length,  a = dome  radius,  <p=scan  direction,  a = 
local  scan  angle  of  feed  array,  = angular  dome 
coordinate. 

gain  over  the  scan  sector  is  derived,  which  is  lower  than  that  given  in  Ref.  2 and  an 
upper  bound  on  the  gain  in  any  particular  scan  direction  is  found.  Illustrative  examples 
of  gain-vs-scan  variation  are  given. 

The  analysis  employs  the  following  assumptions  about  the  antenna  system:  1) 
circular  cylindrical,  thin  dome  with  a fixed  insertion  phase  distribution  \[t  and  with  a 
feed  array  located  on  the  diameter;  2)  scan  coverage  sector  -90  < cp  < 90°;  3)  symmetry 
about  zenith;  4)  use  of  entire  feed  array  for  all  scan  directions;  5)  specified  scan  angle 
limits  for  feed  array,  dome  collector  and  dome  radiator  sides  (in  this  report  60°  was 
used  for  all  three);  6)  validity  of  ray  optics;  7)  absence  of  crossed  rays;  8)  all  of  the 
dome  surface  within  the  scan  range  of  the  feed  array  is  employed. 

The  feed  array  phasing  is  assumed  such  that  the  far  fields  radiated  by  all  ele- 
ments  always  add  coherently  in  the  desired  scan  direction.  It  then  can  be  shown  that 
the  following  upper  bound  holds  for  the  average  gain  (normalized  to  feed  array  broad- 
side gain  2ttL/\) 

tt/2 

8=F(27FL7TT  / G(Cp)dCpl  ^ sinamax 

-n/2 

where  G denotes  the  dome  antenna  gain,  Tl  the  feed  array  aperture  efficiency  and  a 

max 

is  the  maximum  permissible  feed  array  scan  angle.  Thus,  scanning  the  array  to 

a < 90°  results  in  a loss  of  average  gain  («  13%  for  a = 60°). 
max  ° ° max 
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The  scan  angle  constraint  also  leads  to  a lower  bound  on  the  normalized  dome 
phase  gradient  <j>(cP/)  = (X/2na)  di|(/dcp/,  which  in  turn  results  in  an  upper  bound  on  G(cp). 
The  refraction  through  the  dome  is  given  by  the  ray  equation 

sin(cp  - cp')  = stn(a  - cp')  + <t>(cp') 


When  a = 60°,  this  leads,  for  the  worst  case  (cp  = 90°),  to 
m dx 

4»l 7 ) > <t>rnin(cP,>  = COS  cp'  - sin  (y  - cp' ) 


(2) 


(3) 


The  function  4>rnin(cP/)  shown  in  Fig.  2 applies  only  over  a limited  sector  of  the  dome 
(40° < cp'<  80°  with  a = 0.71L),  because  rays  intercepting  the  dome  outside  this  sector 


Fig.  2.  Minimum  phase  gradient  required 
not  to  exceed  60°  scan  angle  on  the 
feed  array. 

by  geometry  alone  (i.e.,  independently  of  phase  gradient)  correspond  to  scan  angles 
< 60°.  Another  restriction  on  the  phase  gradient  arises  from  the  required  symmetry 
about  zenith  which  gives  <j>(0)  = 0. 

For  the  gain  G(cp)  to  be  maximum  in  a particular  scan  direction  cp,  requires  (see 
Fig,  1)  that  the  equivalent  aperture  l/(cp)  be  maximized,  i.e.,  4>  shall  be  such  that  the 
two  intercept  points  Pj  and  be  maximally  separated.  A systematic  use  of  the  condi- 
tions on  ij>,  together  with  the  requirements  that  the  scan  angle  off  the  dome  is  at  most 

60°,  and  that  no  crossed  rays  occur,  leads  to  an  upper  limit  function  L/  (cp)  and  a 

max'  ' 

normalized  maximum  gain  g (cp)  = L1  (cp)/L.  A minimum  gain  value  ■>  can  be 

“ “max'  ' max'T"  “ “min 

estimated  at  cp  = 90°  by  observing  that  L/(90°)  has  a lower  bound.  Figure  3 shows  the 
function  gfnax('P),  Smin(90°),  the  average  g(T|=  1,  dmax  = 60°)  and  the  normalized  upper 
limit  gp  given  by  the  projected  dome  area,  for  a dome  radius  a = 0.71L. 

It  is  noted  in  Fig.  3,  with  a = 60°,  that  there  is  relatively  little  difference 

max 

between  g and  g in  the  sector  40°  < cp  < 90°  and  between  g and  a . at  90°. 

“max  ° — — “max  “min 

This  restricts  the  amount  of  possible  variation  in  G(cp).  A larger  dynamic  range  is 
obtained  by  increasing  the  relative  dome  size  or  relaxing  the  scan  angle  limits,  as 
indicated  by  the  curves  in  Figure  4. 
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once  the  scan  angle  limits  have  been  set. 

Two  examples  secve  to  illustrate  the  above  points.  Figure  5 shows  gain-vs-scan 
curves  obtained  from  a dome  phase  gradient  chosen  so  as  io  maximize  G(67.5°).  The 


Fig.  5.  a)  Desired  gain-vs-scan  curve  g^  and 
synthesized  curve  gs,  b)  synthesized 
phase  gradient  <J>S. 

curves  closely  approach  g (67.  5°)  and  still  give  relatively  high  gain  at  cp  = 90°. 

m elX 

Ballistic  Missile  Defense  Advanced  Technology  Center 

DASG  60-76-C-0006  H.  Steyskal,  A.  Hessel  and  J.  Shmoys 

REFERENCES 

1.  L.  Schwartzman  and  J.  Stangel,  "The  Dome  Antenna,"  Microwave  Journal 
(October  1975). 

2.  J.  Stangel,  "A  Basic  Theorem  Concerning  the  Electronic  Scanning  Capabilities 
of  Antennas,  ■ URSI  Spring  Meeting  Digest  (June  1 1,  1974). 


ELECTROMAGNETICS 


47 


BLAZED  DIFFRACTION  GRATINGS  FOR  FREQUENCY  SCANNED  ANTENNAS 
A.  Hessel,  J.  Shmoys  and  S.  T.  Peng 

A.  Introduction 

Frequency  scanned  traveling  wave  arrays  are  well-known  and  have  received  much 
attention  in  applications  to  line  sources  as  well  as  to  frequency- phase  scanned  phased 
array  antennas.  A different  approach  to  frequency  scanned  phased  arrays  has  been 
proposed  by  McDonnell- Douglas  } In  this  scheme,  a Bragg  angle  blazed  (or  operated 
the  Littrow  condition)  diffraction  grating  with  one  non- specular  diffracted  order  is  used 

as  a reflect  array,  designed  to  frequency  scan  the  n=-l  grating  order,  while  the  spec- 

$ 

ular  beam  is  simultaneously  suppressed.  Alternatively,  a transmission  grating  may  be 
employed  when  blazed  for  Bragg  angle  of  incidence  in  such  a way  that  the  specularly 
reflected  (n  = 0)  beam,  as  well  as  the  transmitted  (n=0)  beam  and  the  reflected  (n  = -l) 
diffracted  order  are  simultaneously  suppressed.  In  that  case,  the  transmitted  n = -l 
grating  order  serves  as  the  frequency  scanned  main  beam.  This  study  deals  with  the 
analysis  and  design  method  of  a class  of  transmission  gratings  consisting  of  parallel 
bars  of  rectangular  cross-section.  We  will  show  that  it  is  possible  to  select  grating 
parameters  and  a direction  of  incidence  in  the  plane  perpendicular  to  the  grating  ele- 
ments (cf.  Fig.  1)  so  as  to  achieve  over  90%  blazing  efficiency  with  30-40°  scan  range 
when  the  incident  wave  is  taken  to  be  polarized  in  the  plane  of  incidence. 


INTERFACE 


INTERFACE 


Fig.  1.  Rectangular  bar  transmission  grating. 


In  a blazed  diffraction  grating,  the  scattered  power  is  concentrated  in  a particular 
spectral  order.  Blazing  may  be  achieved  by  operating  at  Bragg  angle  of  incidence 
(kd  sin  0 - it).  For  an  asymmetric  grating  profile,  blazing  can  be  attained  both  at  Bragg 
angle  and  angles  of  incidence  other  than  Bragg. 
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B.  Analysis 

Let  the  transmission  grating  of  Fig.  1 be  illuminated  in  symmetrical  fashion  by 
two  plane  waves  with  electric  field  (component  transverse  to  z-axis)  amplitudes  Vj  and 
V2-  We  wiu  consider  two  special  cases.  If  Vj  = -V  then  the  x-y  plane  (mid-plane) 
can  be  replaced  by  a perfectly  conducting  sheet  (electric  wall)  and  we  obtain  a short 
circuit  bisection  of  the  structure  (sb);  if,  on  the  other  hand,  Vj  = V^,  then  the  transverse 
to  z magnetic  field  will  vanish  and  we  obtain  the  open-circuit  bisection  (ob),  or  magnetic 
wall. 

The  two  cases  are  illustrated  in  Figs.  2(a)  and  2(b)  where  the  incident  wave  am- 
plitudes as  well  as  n-th  space  harmonic  plane  waves  are  indicated  on  both  sides  of  the 
gracing.  Both  sb  and  ob  fields  satisfy  Maxwell's  equations  and  all  the  boundary  condi- 
tions imposed  by  the  grating  structure.  Hence,  the  sum  of  the  two  fields  also  satisfies 
these  conditions.  For  the  sum  we  have,  as  shown  in  Fig.  2(c),  =2  and  V2  = 0 . Thus, 


(a)  (b)  (c) 

Fig.  2.  Incident  and  diffracted  waves. 

we  identify  T^  = (T  - T )/2  as  the  n-th  mode  transmission  coefficient  and  R = (r°^  + 
IJ,  as  the  reflection  coefficient.  On  the  other  hand,  the  amplitudes  r°band  Tsb  can 

be  interpreted  as  scattering  coefficients  of  a reflecting  grating  consisting  of  grooves  of 
depth  h (half  of  the  height  of  the  original  transmission  grating)  backed  by  an  electric  or 
magnetic  wall.  The  method  of  calculation  of  the  scattering  coefficients  of  a reflection 
grating  was  described  previously2,  3 for  the  short-circuit  case.  The  open-circuit  bi- 
section case  constitutes  a minor  modification. 
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C.  Design 

Transmission  blazing  of  a symmetric  grating  imposes  requirements  on  the  scat- 
tering parameters  of  the  grating,  consequently  on  the  reflection  characteristics  of  the 
short-circuit  bisection  and  open-circuit  bisection  of  the  structure.  For  a perfectly 
blazed  transmission  grating,  one  should  operate  in  the  propagation  range  of  only  two 
orders,  n = 0 and  n = - 1 , and  one  must  have  simultaneously 

R = T = R = 0 (1) 

o o - 1 


where  R and  T are  the  reflection  and  transmission  coefficients  for  the  n-th  order  for 
n n 

the  transmission  grating.  The  first  two  of  these  conditions  are  satisfied  if,  and  only  if. 


U) 


or,  if  both  the  mid-plane  short-circuited  reflection  grating  and  the  mid-plane  open- 
circuited  one  are  simultaneously  blazed.  Thus,  we  should  look  for  a reflection  grating 
which  is  blazed  over  a large  range  of  depth,  i.e.,  blaze  depth  vs.  incidence  angle 
characteristic,  such  as  that  of  Fig.  3,  which  is  nearly  vertical.  The  cancellation  of 


ANGLE  OF  INCIDENCE 

Fig.  3.  Design  curves  for  Bragg  blazed  transmission  grating. 
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n = 0 spectral  orders  both  in  reflection  and  transmission  is  predicated  on  the  existence 
of  the  short-circuit  and  open-circuit  bisections,  i.e.,  on  reflection  symmetry  and  on 
the  existence  of  the  steep  portions  of  the  blazing  curves.  For  a/d  .>0.8  such  charac- 
teristics are  indeed  obtained  and  the  mechanism  responsible  for  this  shape  is  the  inter- 
ference between  two  propagating  parallel  plate  guide  modes.  The  approximate  cancel- 
lation of  the  n =- 1 grating  mode  is  predicated  on  the  and  T*  being  approximately 

180°  out  of  phase  at  the  blazing  condition,  and  by  power  conservation  ) F°^  | = | F | . 
The  r0jb  = -rSb  condition  may  indeed  by  realized  at  the  Bragg  blazing  condition  when 
the  parallel  plate  guide  free  space  interface  discontinuity  can  be  made  small,  i.e.,  for 
thin  plates  and  TM  polarization.  In  this  case  very  little  energy  is  back- scattered  from 
the  junction  discontinuity  and  the  dominant  reflection  is  due  to  the  wave  entering  the 
waveguide  and  reflected  from  the  short-circuit  or  open-circuit  respectively  at  the  bot- 
tom of  the  groove.  But  the  reflection  coefficients  for  both  of  the  propagating  modes 
from  the  open-circuit  are  +1  while  from  the  short-circuit  termination  they  are  -1.  The 
reflected  parallel  plate  modes,  after  passing  the  discontinuity  with  a minor  effect,  re- 
radiate via  the  TS^  or  r°jb.  But,  since  the  two  reflections  from  sb  and  ob  terminations 
are,  as  stated,  out  of  phase,  TS^  and  are  approximately  so. 

In  the  TM  case  studies  were  carried  out  for  a/d  = 0.76,  0.81,  0.86  and  0.91. 
Some  of  the  results  are  presented  in  Figures  3 to  7.  The  ratios  chosen  were  removed 
from  ratios  of  small  integers  in  order  to  avoid  coincidence  between  transverse  wave- 
numbers  of  space  harmonics  and  waveguide  modes  (such  a coincidence  would  result, 
in  the  computer  program  as  it  is  written,  in  an  indeterminate  expression,  0/0).  In 
each  case,  the  Bragg  angle  blazing  characteristics  for  both  the  short-circuit  bisection 
and  open-circuit  bisection  of  the  transmission  grating  were  obtained  (cf.  Figure  3). 

The  areas  near  the  sb  and  ob  intersections  showed  high  conversion  efficiency  into  the 
transmitted  (-1)  diffraction  order.  This  is  not  unexpected  because  the  two  conditions 
(1)  are  satisfied  exactly  at  these  intersections.  A number  of  "operating  points,"  i.e., 
values  of  h/d  and  angle  of  incidence  were  chosen.  These  are  shown  in  Figure  3.  Suc- 
ceeding graphs  show,  as  a function  of  frequency,  the  performance  of  gratings  whose 
dimensions  are  in  accord  with  the  location  of  the  operating  points.  The  direction  of  the 
diffracted  wave  is  used  as  the  abscissa,  rather  than  frequency,  because  the  grating  is 
to  be  used  as  a scanning  device  and  the  angle  is  of  primary  importance.  The  ordinate 
in  these  graphs  is  the  power  level  of  the  n = -l  transmitted  wave  as  compared  with  that 
of  the  incident  wave,  expressed  in  dB.  The  values  of  frequency  shown  at  various  points 
along  each  curve  are  in  GHz  for  a 1 cm  grating  period;  for  other  periods  they  should  be 
scaled  appropriately.  Maximum  efficiency  occurs  near  the  angle  coordinate  of  the 
operating  point  because,  at  Bragg  condition,  the  angles  of  incidence  and  diffractions  are 
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Transmission  loss  of  rectangular  bar  Fig.  5.  Transmission  loss  of  rectangular  bar 
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numerically  equal.  Subsidiary  peaks  appear  to  be  related  to  neighboring  ovals  of  high 
efficiency  (cf.  Figure  3).  Figure  4 shows  typical  performance  of  a grating  designed 
according  to  one  of  the  intersections  of  short-circuit  and  open-circuit  curves  of  Figure 
3.  The  effect  of  a small  change  in  the  angle  of  incidence  is  shown,  for  a grating  with 
somewhat  thinner  bars,  in  Figure  5.  Efficiency  over  the  pass-band  changes  little,  but 
the  angular  range  is  displaced.  The  thinner  the  grating,  the  broader  the  angular  range 
of  its  usefulness.  This,  again,  appears  to  be  related  to  the  spacing  of  the  ovals  and  to 
the  fact  that  the  thinner  the  wall  the  smaller  the  effect  of  the  junction  discontinuity. 

The  resulting  improved  performance  is  shown  in  Figures  6 and  7.  In  these  cases  the 
bar  thickness  is  9%  of  the  grating  period,  and  remaining  parameters  correspond  to  two 
intersections  of  open-  and  short-circuit  blazing  curves.  Less  than  1 dB  conversion 
loss  (not  considering  ohmic  losses)  can  be  achieved  over  an  angular  scan  range  of  more 
than  30°. 

Limited  studies  were  carried  out  for  polarization  parallel  to  grating  elements 
(TE  case),  and,  in  the  TM  case,  for  a grating  of  rectangular  dielectric  bars.  In  both 
these  cases  very  high  conversion  efficiency  was  obtained  for  gratings  designed  using 
the  procedure  described  above,  but  the  bandwidth  and  the  useful  angular  scan  range  was 
found  to  be  much  more  restricted. 
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PROPERTIES  OF  THE  SHADOW  CAST  BY  A HALF-SCREEN  WHEN  ILLUMINATED 
BY  A GAUSSIAN  BEAM 

A.C.  Green,  H,  L.  Bertoni  and  L,  B,  Felsen 
A.  Introduction 

Optical  components  such  as  irises,  occulting  knife  edges  and  stray  light  baffles 
are  designed  for  their  shadow  producing  properties.  For  simple  fields  that  appear  to 
diverge  from  a point  in  the  three  dimensional  case,  a line  in  the  two  dimensional  case, 
or  a pair  of  perpendicular  lines  in  the  astimatic  three  dimensional  case,  the  location 
of  the  shadow  boundary  and  the  nature  of  the  fields  away  from  the  shadow  boundary  can 
be  found  using  the  geometrical  theory  of  diffraction  (GTD).  Such  simple  divergent 
fields  can  be  described  in  terms  of  rays  along  which  the  fields  propagate.  The  rays 
that  graze  the  edge  of  the  shadowing  obstacle,  which  we  call  critical  rays,  define  the 
shadow  boundary.  Treating  the  critical  rays  as  if  they  were  reflected  defines  the 
shadow  boundary  of  the  reflected  field.  Finally,  the  critical  rays  can  be  viewed  as 
generating  the  diffracted  rays  that  are  the  source  of  the  fields  in  the  shadow  region. 

The  simple  distinction  between  incident,  reflected  and  diffracted  fields  is  not  valid 
within  transition  regions  centered  about  the  shadow  boundaries.  Within  these  regions, 
a uniform  theory  of  diffraction  must  be  employed,  which  gives  the  continuous  variation 
of  the  field  across  the  shadow  boundary. 

Optical  fields  are  however  commonly  in  the  form  of  bounded  beams,  which  can- 

1 2 

not  be  described  in  terms  of  real  rays.  ’ In  the  case  of  Gaussian  beams,  the  fields 
can  be  described  in  terms  of  rays  emanating  from  a source  point  having  complex  co- 
ordinates. These  rays  must  be  viewed  as  traveling  through  complex  space  and  inter- 
secting real  space  at  a single  point.  Because  each  of  these  complex  rays  illuminates 
a single  point  in  real  space,  the  critical  rays  that  "graze"  the  shadowing  obstacle  can- 
not define  the  shadow  boundaries  of  the  regions  illuminated  by  the  incident  or  reflected 
fields. 

The  complex  ray  description  of  a Gaussian  beam  vas  previously  used  to  find  the 
diffracted  fields  generated  by  an  iris.'*  However,  the  shadow  boundaries  were  located 
on  the  basis  of  an  ad  hoc  assumption,  rather  than  a rigorous  analysis,  and  no  considera- 
tion was  given  to  the  shape  of  the  transition  regions,  or  to  the  field  variation  within 
them.  For  the  case  of  an  inhomogeneous  plane  wave  incident  on  a half-plane,  the 

location  of  the  shadow  boundary  was  found,  as  well  as  the  shape  of  the  transition  regions 

4 5 

and  the  variation  of  the  fields  within  them.  ’ 

In  this  study,  we  examine  the  location  of  the  shadow  boundaries  that  result  when 
a two-dimensional  or  ribbon  beam  having  Gaussian  profile  is  incident  on  a perfectly 
conducting  half-plane,  as  shown  in  Figure  1.  The  concept  of  a shadow  boundary  is 
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associated  with  the  high  frequency  asymptotic  approximation  of  the  total  field,  which 
separates  the  field  into  incident,  reflected  and  diffracted  components.  We,  therefore, 
start  with  an  exact  integral  representation  for  the  total  field,  and  approximate  the 
integral  asymptotically. 

Except  when  the  beam  axis  passes  through  the  edge,  the  shadow  boundary  of  the 
incident  (reflected)  beam  field  is  shown  to  be  a Stokes  line  of  the  integral  representa- 
tion. As  a result,  the  incident  (reflected)  field  is  of  exponentially  smaller  order  there 
than  the  diffracted  field.  Since  the  fields  are  in  fact  continuous  across  the  shadow 
boundary,  taking  the  incident  (reflected)  field  to  exist  on  one  side,  but  not  on  the  other, 
introduces  an  error  that  is  on  the  order  of  the  incident  (reflected)  field,  and  hence  of 
exponentially  small  order  compared  to  the  diffracted  field.  Unlike  the  case  of  a real 
line  source,  or  when  the  screen  edge  lies  on  the  beam  axis,  the  diffracted  fields  pro- 
duced by  a beam  are  finite  and  continuous  across  the  shadow  boundaries.  Near  the 
screen  edge,  the  shadow  boundaries  for  the  incident  beam  also  coincide  with  those  of 
an  inhomogeneous  plane  wave,  although  they  deviate  far  from  the  edge.  As  the  edge 
approaches  the  beam  axis,  the  shadow  boundary  approaches  that  of  a real  line  source 
located  at  the  center  of  the  beam  throat. 

The  asymptotic  approximation  that  allows  separation  of  incident,  reflected  and 
diffracted  field  components  is  valid  outside  of  some  transition  regions.  The  size  and 
shape  of  these  regions  is  dependent  on  the  accuracy  stipulated  for  the  approximation, 
and  on  the  beam  width  and  edge  location.  Within  the  transition  regions,  a uniform 
asymptotic  expression  must  be  used  for  the  field.  In  the  case  of  a real  line  source 
fields,  the  transition  regions  extend  to  infinity  and  are  centered  about  the  shadow 
boundaries.  By  contrast,  it  is  shown  here  that  the  transition  regions  for  the  incident 
beam  are  not  centered  on  the  shadow  boundaries,  except  when  the  beam  axis  passes 
through  the  screen  edge.  Moreover,  if  the  edge  is  located  far  enough  from  the  beam 
axis,  the  transition  regions  do  not  extend  to  infinity,  but  are  instead  bounded,  as  is 
the  case  when  an  inhomogeneous  plane  wave  is  incident.  When  the  edge  is  near  enough 
to  the  beam  axis,  the  transition  regions  extend  to  infinity.  If  the  edge  is  located  in  the 
far  field  region  of  the  beam,  the  shadow  boundaries  and  transition  regions  approach 
those  of  a line  source  centered  at  the  beam  throat. 

B.  Formal  Solution  for  the  Diffracted  Fields 

It  has  previously  been  shown  that  the  free-space  Green's  function  which  is  an 
exact  solution  to  Maxwell's  equations,  gives  the  fields  of  a Gaussian  beam  when  complex 
values  are  assigned  to  the  source  coordinates.1,2  Starting  with  the  point-source 
Green's  function,  a Gaussian  beam  with  circular  cross-section  is  obtained.  Using  the 
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line-source  Green's  function  gives  a ribbon  beam  whose  profile  is  Gaussian.  The  ad- 
vantage of  using  the  Green's  function  to  obtain  a beam  field  is  that  t .pressions  derived 
for  the  propagation  and  diffraction  of  point-source  or  line -source  fields  can  be  used  to 
study  the  propagation  and  diffraction  of  beams. 

For  this  study  we  consider  a two-dimensional  or  ribbon  beam  propagating  parallel 
to  the  z-axis  with  no  variation  along  y,  as  depicted  in  Figure  1.  If  the  electric  field  is 
polarized  along  y,  the  beam  field  (_£;_£')  are  obtained  from  the  free-space  Green's 
function  for  the  current  line  source 


l-l o 


* 6(x-x')  6(z-z')e  ''u>t 


(1) 


with  complex  source  coordinates 


z'  = - z + ib 
o 


(2) 


In  Eq.  (1),  6(x-x')  is  the  delta  function,  u»  the  radian  frequency  and  the  permeativity 
of  free-space.  The  electric  field  generated  by  the  source  is 

1^/1,131  D - l(x+  '** 


E.(£;£')  =^HoU,(kR);  R = xq)  + (z  + zq  - ib) 


(3) 


tx 


Fig.  1.  Gaussian  beam  equivalent  to  the  field  radiated  by  a line 
source  at  the  complex  location  (-x,  -zQ+ib).  The  beam 
is  indicated  by  phase  paths  along  which  the  exponential 
amplitude  of  the  field  is  constant. 
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where  (kR)  is  the  zero  order  Hankel  function  of  the  first  kind,  and  k = w/C  is  free- 
space  wavenumber.  The  complex  displacement  R has  branch  point  singularities  at 
x=  - xq  ± b,  z = -z^  where  R=  0.  In  order  for  R,  and  hence  E.  (_£;_£')  to  be  single 
valued,  it  is  necessary  to  connect  the  branch  points  with  a cut,  such  as  the  one  shown 
in  Figure  1.  The  field  is  discontinuous  acorss  the  cut,  which  requires  the  presence 
sources.  Viewed  another  way,  by  placing  the  proper  source  distribution  along  the  cut, 
the  radiated  fields  will  be  those  of  Equation  (3).  For  the  choice  of  cut  in  Fig.  1, 
Re(R)>0  and  Eq.  (3)  represents  an  outgoing  wave  for  p-»oo. 

Far  from  the  branch  points,  k ( R | » 1 and  the  Hankel  function  may  be  replaced 
by  its  asymptotic  approximation  so  that 


E.  ( p ; pi)  - ex£_[i(kR+  tt/4)  ] 
1 2 J 2TTkR 


(4) 


2 2 2 

In  the  paraxial  region  (x+xq)  « (z+zq)  +b  and  we  may  approximate  R in  Equation 

(4).  For  z>  - z we  obtain 
o 


Ej(p;  p>)  - 


exp  [kb  + ik(z-zQ)  + in/4] 


2 J 2rr  Vk(z  + zq-  ib) 


exp 


z + z 


(1  -i 


(5) 


whe  re 


w =,/2 


(z+  zo)2  + b2 


kb 


(6) 


Expression  (5)  is  the  field  of  a Gaussian  beam  whose  l/e  half-width  is  w.  For  z + z ->  0 

—T—  ° 

the  half-width  is  seen  to  be  J 2b/ k so  that  the  parameter  b represents  the  classical 

distance  to  the  far  field.  In  the  Fraunhoffer  or  far  field  region  where  z + z » b.  the 

o ’ 

beam  diverges  and  appears  to  come  from  a line  source  located  at  the  center  ( -x  . - z ) 

o o 

of  the  beam  throat. 

The  field  Eq.  (3)  is  assumed  to  be  incident  from  the  left  on  a conducting  half- 
screen occupying  the  half-plane  z=0,  x <0,  as  shown  in  Figure  1.  An  expression  for 
the  total  field  radiated  by  a real  line  source  in  the  presence  of  the  screen  has  previously 

6 7 r 7 

been  derived.  ’ Letting  the  source  have  complex  location,  we  obtain  the  total  field 
* 

for  the  beam  case  as 


E(p;£.)  = C*P^>  1+ 


exp(ikR  ) 
2TT 


(7) 


This  result  is  obtained  from  Ref.  [6]  with  the  substitution  p = t exp(irt/4). 
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f-J 


-t 


dt 


W*  ^t2  -i2kR* 


(8) 


In  Eqs.  (7)  and  (8),  R and  R are  the  complex  distances  to  the  observation  point  from 
the  source  at  (-xq,  -z^  + ib)  and  from  the  image  source  at  (~xq>  ZQ-ib),  respectively. 
These  distances  are  given  by 


R " = /(x  + x )2+  [ z T ( - z + ib)  ] 
\ o o 


(9) 


± ± 

where  the  root  is  chosen  such  that  Re(R  )>  0.  The  end  points  W of  the  integration 
are  defined  below. 

Let  ( p , 0)  with  - TT  < 0 < TT  be  the  polar  coordinate  of  the  observation  point  and 
(p1,  0')  be  the  polar  coordinates  of  the  source  given  by 


0'  = tan 


-1 


z - ib 
o 


,=  „/7*7 


p,=  V Xo  + (zo  ” ib) 


(10) 


with  Re(p')  > 0 and  0<  Re(0')  < TT.  The  angles  0 and  O'  are  indicated  in  Fig.  2(a)  for 
a real  line  source  (b  = 0). 

Imt 


Ret 


(a) 

Fig.  2.  (a)  Polar  coordinate  system  showing  source  co- 

ordinates for  a real  line  source,  (b)  Integration 
path  and  branch  points  of  the  integral  representa- 
tion for  the  total  field. 
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The  end  points  W of  the  integrations  in  Eq.  (8)  are  given  in  terms  of  the  polar 
coordinates  as 


W*  = 2e-iTT/4  /■■  kPP'  , 

J p+  p'  + R' 


- sin  j (0  - 0') 


cos  Y (9  + 6') 


W~  = - e "lTT/4  k(  p + p'  - R^  ) 


For  a complex  source,  W “ = 0 only  at  the  screen  edge  p = 0,  so  that  W is  single  valued 
in  the  domain  -Tt/2<  8<  3tt/2.  The  choice  of  square  root  in  Eqs.  (11a)  or  (lib)  is 
made  so  that  Re  W >0  for  p -*  with  0 ->  - tt/2,  since  as  argued  in  the  next  section 
only  the  diffracted  field  reaches  the  far  side  of  the  screen. 

Steepest  descent  paths  for  the  integrations  in  Eq.  (7)  are  indicated  in  Fig,  2(b) 
for  the  case  (A)  when  Re  (W")  > 0 and  case  (B)  when  Re  (W-)  < 0,  The  branch  points 
- e*n/4^/2kR*  and  hill  regions  of  the  integrands  are  also  indicated  in  the  figure.  For 
later  use,  we  may  express  R in  terms  of  the  polar  coordinates  as 

R"  = J P2  t 2 pp1  cos(6  T 0')  + (p')Z  . (12) 

C,  Asymptotic  Approximation  for  the  Fields  and  the  Shadow  Boundary 

In  the  geometrical  theory  of  diffraction  (GTD),  the  total  field  E(_p,_£')  is  separated 
into  a geometrical  optics  component  EGQ(£.  p')  and  a diffraction  component  ED(_p,  £>') 
as 

£(£;_£')  = Ed(£;  _g')  + EGO(f,  fp7)  (13) 

The  geometrical  optics  component  consists  of  the  incident  and  reflected  fields,  which 
exist  in  regions  of  space  bordered  by  shadow  boundaries.  The  diffraction  component 
includes  the  remainder  of  the  field. 

Concepts  such  as  the  shadow  boundary  that  are  basic  to  GTD  arise  out  of  the 
asymptotic  approximation  for  the  field  Equation  (7).  The  asymptotic  approximation 
for  the  integral  in  Eq.  (8)  may  be  obtained  by  taking  the  integration  path  to  be  the 
steepest  descent  path  away  from  the  end  point  W . For  Re(W  ±)  > 0 the  steepest  descent 
path  runs  to  t = + oc  asymptotic  to  the  real  t axis,  as  shown  by  path  (A)  in  Figure  2(b). 
When  Re(W~)<  0,  the  steepest  descent  path  runs  asymptotic  to  the  negative  real  t axis. 
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In  this  case  >t  is  necessary  to  return  to  the  end  point  t=+  oo  via  the  saddle  point  at  t=  0, 
as  shown  by  path  (B)  in  Figure  2(b).  Thus  the  saddle  point  contributes  to  Eq.  (8)  for 
Re(W  “)  < 0,  but  not  fcr  Re(W~)  > 0. 

Let  0gi(p)  he  the  locus  of  points  for  which  Re(W  + ) = 0,  and  0 r(p)  be  he  locus 

points  for  which  Re(W ')  = 0.  For  8>  0 (p),  Re(W  + ) < 0 and  the  saddle  point  contributes 
+ . . - 

to  I . Similarly,  for  0 > 0 (p),  Re(W  ) < 0 and  the  saddle  point  contributes  to  i". 

b 1 / 

The  saddle  point  contributions  can  be  expressed  in  terms  of  Hankel  functions,  and 

are  found  to  allow  their  interpretation  as  being  the  geometrical  optics  contribution  to 

the  field.  Thus, 

= u[0-0si(p)]Ho1)(kR+)  - u[0-0sr (p)lHo1)(kR’)  . (14) 

where  the  first  term  gives  the  incident  field  and  the  second  term  gives  the  reflected 
field.  The  locii  of  points  p ) and  0sr<P)  are  seen  to  be  the  shadow  boundaries  of 
the  incident  and  reflected  fields,  respectively. 

In  the  case  of  a real  line  source  (b  = 0),  0'  and  R~  are  real  and  hence  from  Eq. 

(11)  Re(W  + ) = 0 when  sin|-  ( 0- 0 ’)  = 0,  while  Re(W')  = 0 when  cos^-(0+e')  = 0.  Thus 
9gir  6'  and  0gr  = TT  - 0 , so  that  the  two  shadow  boundaries  are  straight  lines  originating 

at  the  edge,  and  are  symmetrically  located  about  the  x-axis.  The  shadow  boundaries 
coincide  with  the  incident  and  reflected  rays  that  pass  through  the  edge. 

For  a line  source  located  at  a complex  point,  0q  and  R±  are  complex  so  that 
0si(p)  and  ®sr(P)  do  not  have  a simple  geometrical  interpretation,  although  we  still 
have  that  0sJ.(p)  = TT  - 0si(  P ) . In  order  to  locate  the  shadow  boundary  0si(p),  note 
that  if  Re(W^)  = 0,  then  arg(W  ) = TT.  With  the  help  of  Eq.  (lib)  this  last  condition 

is  seen  to  require  that  Re(  p + p'  - R+)  = 0 with  Im(  p+  p'  - R+)  < 0.  Simple  expressions 
for  0^(  p)  are  obtained  in  the  limits  p « | p'  | and  p » | p'  | . 

For  p « | p' | we  approximate  R+  From  Eq.  (12)  as 

R+  p'  + p cos  (0  - 0')  (15) 

and  obtain  for  the  shadow  boundary  0^ ^ ( 0)  near  the  edge  the  expression 
tan  [0 ^(0)  -u]  = sinhv 

where 

u = Re(0')1 
v = lm(0')  I 


. 


(16) 


(IT) 
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The  location  Eq,  (16)  is  the  same  as  was  previously  found  for  an  inhomogeneous  plane 
wave  incident  on  the  edge  when  the  wave  vector  of  the  plane  wave  makes  the  (complex) 
angle  01  with  the  z'-axis.  In  the  vicinity  of  the  edge,  the  incident  field  radiated  by 
the  complex  line  source  is  locally  that  of  an  inhomogeneous  plane  wave  whose  wave 
vector  V(kR)  makes  the  angle  0'  with  the  z-axis,  thus  accounting  for  the  location  of 
the  shadow  boundary  in  the  vicinity  of  the  edge. 


n 2 

Assuming  the  edge  to  be  in  the  paraxial  region  of  the  incident  beam  (xq  <K  ^/zq  +b  ), 
it  is  seen  from  Eq.  ( 1 0)  that  | 0 1 1 « 1 and  hence 


u » Re 


/ x \ 
(r^tr) 


X z 
o o 

~z~2 

z + b 
o 


x b 


v * Im  (r~r n?) = “z2-? 


z + b 
o 


(18) 


For  | v J « 1,  tan  (sinh  v)  « v so  that  from  Eq. 


(16) 


x (z  + b) 

eai(°)  es  u + v = - °~y°-  j > (19) 

z + b 
o 

which  is  the  angle  the  shadow  boundary  makes  with  the  z-axis  for  p«  | p1 1 . The  shadow 
boundary  is  indicated  in  Fig.  3 for  the  case  xq  > 0,  which  occurs  when  the  edge  blocks 
the  beam  axis.  When  the  edge  lies  on  the  beam  axis,  the  shadow  boundary  coincides 
with  the  beam  axis. 

For  p » | p'  | , i.  e.  , far  from  the  edge,  we  may  approximate  R+  from  Eq.  (12)  as 
R+  « p + p1  cos(0  - 0') 

and  obtain  for  the  shadow  boundary  0 (oo)  the  expression 

tanl~0  .(oo)  -ul=  sinh  v fi.e li1 . ) (21) 

S1  | p'  | + Im(  p')  cosh  v 

The  location  of  the  shadow  boundary  9sj(°°)  f°r  p » | p’ | depends  strongly  on  the  location 
of  the  edge  within  the  beam  through  the  factor  multiplying  sinhv  in  Equation  (21). 

2 2 2 

In  the  paraxial  region  xq  « zq  + b , and  From  Eq.  (10)  we  have 


(22) 
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Fig.  3.  Shadow  boundaries  o£  the  incident  and  reflected  fields, 
so  that  Im(p')  < 0.  Because 

Re(  p1)  + |lm(  p')  | cosh  v > Re(  p')  + |lm(  p ')  | > | p'  | (23) 

it  is  seen  that  for  Im(p')  < 0, 

Re(p')  > | p' | + Im(p')  cosh  v . (24) 

Thus  the  factor  multiplying  sinhv  in  Eq.  (21)  will  be  greater  than  unity.  Comparing 
Eqs,  (16)  and  (21)  we  see  that 

0si<°°>  > 0si<°>  (25) 

when  the  edge  is  located  in  the  paraxial  region.  As  a result,  the  shadow  boundary  turns 
away  from  the  z-axis  for  p -»<*,  as  is  shown  in  Figure  3.  Should  the  denominator  of 
Eq.  (21)  vanish,  then  0^(00)  -u  = Tt/2  and  shadow  boundary  will  run  to  p -» 00  at  angle  u 
from  the  x-axis.  Consistent  with  paraxial  approximation  Eq.  (22),  the  denominator 
vanishes  for  | xq  | » z^  « b,  i.e.,  when  the  edge  is  located  in  the  near  field  along  a 
line  at  45  from  z-axis.  For  edge  locations  above  this  45°  line,  the  shadow  boundary 
turns  further  behind  the  x-axis  into  the  region  z<  0. 
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The  location  of  the  shadow  boundary  0 .(p)  is  shown  in  Fig.  3 for  the  case  x > 0 
so  that  edge  blocks  the  main  part  of  the  beam.  For  xq  < 0,  the  shadow  boundary  is 
located  symmetrically  below  the  z-axis.  For  comparison,  we  have  also  drawn  in 
Fig.  3 the  curve  passing  through  edge  along  which  Im  p'  has  constant  value.  This  curve 
represents  a contour  along  which  the  exponential  amplitude  of  the  incident  beam  is 
constant.  When  the  edge  is  located  in  the  paraxial  region  of  the  beam,  the  curve  has 
slope 


X z 
O Q 

z 2 + b' 
o 


= u ( p ->  0) 


z + b 
o 


(p  -.00) 


The  slope  for  p -* « has  magnitude  smaller  than  |u+v|,  as  indicated  in  Figure  3. 

It  is  seen  that  the  shadow  boundary  curves  into  a region  where  the  incident  beam 
field  is  exponentially  small  compared  to  its  value  at  the  edge.  Since  the  diffracted 
field  Ed  everywhere  has  exponential  order  equal  to  the  order  of  the  incident  beam  at 
the  edge,  along  the  shadow  boundary  EQ  is  exponentially  larger  than  the  incident  field 
EGO'  Thus>  as  the  shadow  boundary  is  crossed,  a discontinuity  in  the  field  is  obtained 
whose  magnitude  is  equal  to  I Eqq | . which  is  exponentially  smaller  than  the  diffracted 
field  there.  In  the  literature  on  asymptotic  analysis,  this  behavior  is  referred  to  as 
the  Stokes  phenomena  and  the  shadow  boundary  is  referred  to  as  a Stokes  line. 

The  properties  of  the  shadow  boundary  cited  above  are  in  accordance  with  those 
found  for  an  inhomogeneous  plane  wave  blocked  by  an  edge.  Near  the  edge  the  shadow 
boundary  has  the  same  location  as  would  be  obtained  for  the  inhomogeneous  plane  wave 
that  approximates  the  incident  beam  field  in  the  vicinity  of  the  edge.  However,  far 
from  the  edge  the  shadow  boundary  is  shifted,  indicating  that  its  location  cannot  be 
predicted  solely  from  the  properties  of  the  field  in  the  vicinity  of  the  edge,  but  will 
also  depend  on  the  properties  of  the  field  far  from  the  edge. 

D.  Diffracted  Field  and  the  Transition  Region 

The  diffracted  field  ED(£;  p')  is  given  by  the  integration  in  Eq.  (8)  over  the  path 
segment  from  the  end  point  W*  to  |t|  -*».  These  segments  are  indicated  in  Fig.  2(b) 
as  (A)  for  Re  Vf±>  0 and  as  (B-l)  for  Re  W 1 < 0.  For  |w|  » 1,  these  integrations  can 
be  approximated  asymptotically  by  the  method  of  steepest  descent.  Retaining  the  first 
two  non-vanishing  terms  in  the  asymptotic  approximation  gives 


I 


ED(p;  p«)  - E.(0;  p1) 
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,i(kp  + tt/4) 


J 2nkp 


- cos(0-t 


where 


+ 

A'  = 


i(P+  P’) 

2k pp* [1  t cos(e+  e')] 


In  Eq.  (27),  E.(0,  p')  is  the  field  incident  on  the  edge,  as  given  by  Eqs.  (4)  or  (5)  with 
R=  p'. 

Neglecting  the  terms  A in  Eq.  (27)  yields  the  geometrical  theory  of  diffraction 

(GTD)  approximation  EGTD  for  the  diffracted  field.  The  expression  for  EGTD  is  the 

same  as  was  previously  obtained  for  an  inhomogeneous  plane  wave  propagating  at  a 

complex  01  and  incident  on  the  edge.  The  terms  A represent  corrections  to  E__--. 

i GTD 

Thus,  the  accuracy  of  E^^.^  is  determined  by  |A  |, 

For  a real  line  source  01  is  real  and  the  field  is  singular  across  the  shadow 
boundaries.  Because  0'  is  complex  for  a beam,  the  field  is  never  singular.  How- 
every  | A+ | may  be  significant  compared  to  unity  in  the  vicinity  of  the  shadow  boundary 
for  the  incident  field.  Similarly  |A  | may  be  significant  in  the  vicinity  of  the  shadow 
boundary  of  the  reflected  field.  In  those  regions  of  space  where  A*  does  have  a signi- 
ficant magnitude,  the  asymptotic  approximation  cannot  be  used  and  a more  accurate 
evaluation  of  the  integrals  I*  is  required.  By  analogy  to  the  case  of  a real  line  source, 
these  regions  in  space  are  called  transition  regions  where  the  GTD  expressions  for 
the  diffracted  field  are  not  valid. 

The  transition  regions  can  be  defined  as  the  locations  in  space  where  the  rela- 
tive error  made  in  using  EqTD  for  Eq  has  magnitude  less  than  some  value  l/A,  where 
A is  a large  number.  Then  the  transition  regions  are  defined  by 

I A±|  > l/A  . (29) 

The  boundary  of  the  transition  region  is  obtained  from  the  equality  in  Equation  (29). 

Using  Eq.  (28)  in  Eq.  (29),  we  obtain  for  the  boundary  of  th-*  incident  field's 
transition  region  the  equation 

| p + p'|  A = 2kp  | p'|  | 1 - cos(0  - 0 ')  | . (30) 

Substituting  01  = u + iv  and  squaring  both  sides  gives 
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2 2 

(p2+2pRep'  + | p' | 2)  = I I — p2 -^[l  - cos(G-u)  coshv]2+ sin2(0-u)  sinh2v  | , 

(31) 


whose  solution  for  p as  a function  of  9 gives  the  boundary  of  the  transition  region.  It 
is  seen  that  the  transition  region  is  symmetric  about  the  radius  9 = u extending  from 
the  edge.  To  find  the  maximum  distance  that  the  transition  region  extends  away  from 
the  edge,  set  9=u  in  Eq.  (31)  and  solve  the  resulting  quadratic  for  p.  Since  Re  p*  > 0 
the  quadratic  will  have  a real  solution  with  p > 0 only  if 


(32) 


If  the  inequality  Eq.  (32)  is  not  satisfied,  no  positive  real  solution  will  exist  for  p, 
indicating  that  the  transition  region  extend  to  p = oo,  as  happens  for  a real  line  source. 

The  case  of  the  transition  region  extending  to  p = oo  is  shown  in  Fig.  4,  while  a transition 
region  of  finite  extent  is  depicted  in  Figure  5. 


Fig.  4.  Transition  region  about  the  incident-field 
shadow  boundary  for  the  case  when  the 
transition  region  extends  to  p->oo. 

When  the  edge  is  in  the  paraxial  region  of  the  beam,  | v | « 1 and  |p'  |2«z2  + b2 
so  that  with  the  help  of  Eq.  (18),  condition  (32)  for  the  transition  region  to  be  finite 
extent  becomes 
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Fig,  5.  Transition  region  about  the  incident-field  shadow 
boundary  for  the  case  when  the  transition  region 
is  of  finite  extent  along  the  radius  0 = u. 


* bV/4 

b 


(33) 


Let  us  compare  | | to  the  l/e  half-width  w of  the  beam  at  the  edge,  which  is  given 

by  (6)  with  z = 0.  Thus,  Eq.  (33)  becomes 


+ b2) 


1/4 


(34) 


In  order  to  understand  the  significance  of  Eq.  (34),  first  suppose  the  edge  is 

located  near  the  throat  of  the  beam  so  that  z « b.  Condition  (34)  then  reduces  to 

° 

lxJ/w  > v/A/2.  which  is  satisfied  for  nominal  values  of  A if  the  edge  is  outside  of 

the  main  portion  of  the  beam.  For  this  location  of  the  edge,  the  transition  region  has 

finite  radial  extent  away  from  the  edge,  as  depicted  in  Figure  5.  However,  for  | x | -* 

condition  (34)  is  not  satisfied  for  any  value  of  A and  the  transition  regions  extends  to 

P as  in  Figure  4.  The  discussion  given  above  for  zq  « b also  holds  qualitatively 

for  z - b. 
o 

For  zQ»b,  condition  (34)  becomes  |x^|/w  > J~Az~/Zb  and  is  satisfied  only  for 
xq  much  larger  than  the  local  beam  width  w.  In  this  case  the  field  incident  on  the 
edge  E.(0;_g'),  and  hence  the  diffracted  fields  are  much  smaller  than  the  field  on  the 
beam  axis.  When  |xq|/w<  J z~A/2t>,  the  transition  regions  extend  to  p-*oo,  as  in 
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Figure  4.  For  zu»b,  the  shadow  boundary  0^  approaches  u w tan  \xo/z^).  Thus 
the  transition  region  becomes  centered  about  the  shadow  boundary,  which  is  the  ex- 
tension of  the  straight  line  from  the  beam  center  (-xq,  -zq)  passing  through  the  edge 
This  behavior  is  the  same  as  would  be  obtained  for  a line  source,  which  is  to  be  ex- 
pected for  zq  "%>  b since  the  edge  is  then  located  in  the  far  field  region  of  the  source 
where  the  radiated  fields  are  those  of  a line  source  multiplied  by  a slowly  varying 
pattern  function. 

When  the  transition  region  does  extend  to  p it  occupies  a region  of  finite 

angular  width  for  p-*^>.  The  width  can  be  found  from  Eq.  (31)  by  retaining  only  the 
highest  powers  in  p.  Using  small  argument  expansions  for  the  trionometric  and 
hyperbolic  functions,  it  is  found  that  the  angular  width  A8  of  the  transition  region  is 


Depending  on  the  location  of  the  edge,  the  transition  region  may  cover  the  shadow 
boundary  and  the  beam  axis. 

The  foregoing  discussion  of  the  transition  region  about  the  shadow  boundary  of 
the  incident  field  also  holds  for  the  reflected  field  transition  region.  The  reflected 
field  transition  can  be  bound  by  inverting  the  incident  field  transition  region  through 
the  plane  z = 0. 
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SCATTERING  OF  SURFACE  WAVES  BY  A DIELECTRIC  STEP  DISCONTINUITY: 
OBLIQUE  INCIDENCE  CASE 

J.P.  Hsu,  S.  T.  Peng  and  A.  A.  Oliner 

A step  discontinuity  between  two  uniform  dielectric  waveguides  occurs  naturally 
in  many  problems  of  current  interest.  For  example,  a dielectric  stripline  for  either 
millimeter  wave  or  integrated  optics  applications  may  be  viewed  as  consisting  of  two 
step  discontinuities  at  the  side  walls,  and  a guided  wave  may  be  viewed  as  being  reflect- 
ed back  and  forth  by  the  step  discontinuities  as  it  propagates  along  the  stripline,  as 
shown  in  Figure  1.  Also,  with  the  mathematical  rigor  attainable  in  such  an  electro- 


GROUND  PLANE 


Fig.  1.  Guidance  along  a dielectric  ridge  waveguide. 

magnetic  boundary  value  problem,  many  practical  problems  that  are  not  amenable  to 
a rigorous  analysis  can  be  approximated  by  a structure  with  one  or  more  step  discon- 
tinuities, known  as  the  staircase  approximation.  Such  an  approach  has  been  employed 
in  a study  of  the  radiation  characteristics  of  high  gain  dielectric  taper  antennas  of  the 
type  shown  in  Figure  2. 


Fig.  2.  Dielectric  taper  antenna  --  symmetric  bisection. 

The  staircase  approximation,  involving  a series 
of  dielectric  steps,  is  shown. 

In  the  literature,  the  analysis  of  the  step  discontinuity  has  been  restricted  to  the 
special  case  of  normal  incidence,  where  the  TE  and  TM  surface-wave  modes  exist 
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independently.  On  the  other  hand,  at  an  oblique  incidence  angle,  both  TE  and  TM 
modes  must  be  simultaneously  present  in  order  to  satisfy  the  required  boundary  condi- 
tions at  a step  discontinuity.  In  other  words,  for  either  TE  or  TM  mode  incidence, 
both  modes  are  excited  at  the  step  discontinuity.  Such  mode  coupling  may  result  in 

interesting  physical  phenomena,  such  as  a novel  resonance  effect  and  leakage  of  energy 

4 

from  a dielectric  waveguide. 

The  difficulty  encountered  in  solving  the  class  of  dielectric  waveguides  which  are 
open  structures  is  in  the  handling  of  the  continuous  spectrum  that  accounts  for  the  radi- 
ation and  storage  of  electromagnetic  energy  in  the  presence  of  discontinuities.  In  the 
special  case  of  normal  incidence  on  the  step  junction,  a rigorous  treatment  of  the  con- 
tinuous spectrum  requires  the  solution  to  an  integral  equation  which  is  not  amenable 
to  an  exact  analysis.^  In  the  general  case  of  oblique  incidence,  it  is  expected  that  an 
exact  analysis  will  result  in  a system  of  integral  equations  for  which  the  construction 
of  an  accurate  solution  is  even  harder.  Instead  of  approximating  the  governing  equations, 
we  enclose  the  dielectric  stripline  in  a metallic  waveguide  so  that  the  mode  functions 
for  the  representation  of  the  electromagnetic  fields  become  discrete.  In  such  a closed 
system,  the  mode  amplitudes  are  determined  from  a system  of  linear  equations  from 
which  the  numerical  results  can  be  obtained  with  known  accuracy.  Therefore,  the 
novel  wave  phenomena  previously  predicted  can  now  be  analyzed  accurately. 

A step  discontinuity  between  two  dielectric  waveguides  is  shown  in  Figure  3.  The 


''PERFECT  ELECTRIC  CONDUCTOR 
(GROUND  PLANE) 

(0)  TOP  VIEW  (b)  END  VIEW 

Fig.  3.  Scattering  of  a surface  wave  by  a dielectric  step  discontinuity. 
The  top  view  in  (a)  shows  that  incidence  at  an  oblique  angle 
produces  coupling  between  TE  and  TM  modes.  The  structure 
shown  in  end  view  in  (b)  includes  the  fictitious  top  perfectly- 
conducting  plate  introduced  to  aid  in  the  calculations  by  dis- 
cretizing the  continuous  spectrum. 


— - v*,' 
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discontinuity  may  be  due  to  a change  in  the  physical  dimensions  (tj  / t^)  or  the  material 
property  (t  j £ i or  both.  Here,  we  assume  that  the  structure  is  uniform  and  infinite 
in  extent  along  the  z-direction.  For  most  millimeter  wave  applications,  the  dielectric 
waveguides  are  expected  to  be  on  a ground  plane,  as  shown  in  Figure  3.  Such  a struc- 
ture may  also  be  considered  as  the  short-circuit  bisection  of  a larger  structure  that  is 
symmetric  in  y.  The  upper  metallic  plate  at  the  height  y = h is  introduced  artificially 
in  order  to  discretize  the  mode  functions  so  that  the  ensuing  analysis  can  be  carried 
out  exactly.  The  mode  functions  in  a partially  filled  parallel  plate  waveguide  are  well- 
known;  they  consist  of  surface-wave  modes  that  are  evanescent  in  the  air  region  and 
non- surface- wave  modes  that  vary  sinusoidally  in  the  transverse  (y)  direction.  Since 
the  incident  wave  is  expected  to  be  a surface- wave  mode  which  is  evanescent  in  the  air 
region,  the  effect  of  the  upper  metallic  plate  at  a sufficiently  large  height  should  be 
exponentially  small.  For  the  original  open  structure  (without  the  upper  metallic  plate), 
the  energy  radiated  away  from  the  discontinuity  can  now  be  accounted  for  by  the  propa- 
gating non- surface- wave  modes  of  the  closed  waveguide  and  the  energy  stored  in  the 
vicinity  of  the  discontinuity  can  be  accounted  for  by  those  modes  that  are  below  cutoff 
in  the  x-direction  in  the  metallic  waveguide.  Therefore,  in  this  approach,  it  is  neces- 
sary to  evaluate  the  amplitudes  of  the  propagating  as  well  as  the  evanescent  modes. 

It  is  shown  in  Fig.  3(a)  that  a surface  wave  mode  is  incident  obliquely  at  an  angle 
6 onto  a step  discontinuity  between  two  uniform  dielectric  waveguides.  In  contrast  to 
the  special  case  of  normal  incidence  (0  =0),  the  general  case  of  oblique  incidence  is  a 
vector  boundary  value  problem  that  requires  the  simultaneous  presence  of  both  "TE" 
and  "TM"  modes  (w.r.t.  their  directions  of  propagation).  With  respect  to  the  structure 
coordinate  (x,  y and  z)  system,  as  specified  in  Fig.  3,  the  field  components  of  the  "TE" 
and  "TM"  surface-wave  modes  are  given  in  Table  I.  The  notation  used  in  the  table  is: 

A single  prime  denotes  a quantity  for  the  TE  modes  and  a double  prime  for  the  TM 
modes,  (i  is  a transverse  mode  function  which  is  assumed  known.  V and  I are  the 
transmission  line  voltage  and  current,  respectively,  Zg  is  the  surface  wave  impedance, 
and  and  are  the  impedances  with  respect  to  the  x and  z directions.  In  order  to 
match  the  boundary  conditions  at  the  step  discontinuity  at  x = 0,  the  z-component  of  the 
propagation  vectors  of  the  scattered  waves  must  be  equal  to  that  of  the  incident  wave; 
the  exponential  function  expf-jk^z)  accounting  for  the  phase  change  in  the  z direction  is 
common  to  every  field  component  of  any  waveguide  mode  and  is  therefore  suppressed 
in  the  table  for  simplicity.  The  continuity  of  the  four  tangential  field  components  (E^, 

E , H and  H^)  results  in  the  following  four  systems  of  linear  equations  that  can  be 
written  in  the  matrix  form  as: 
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TABLE  I.  Field  components  and  their  representations 


TE  mode 

TM  mode 

E;  = Z'z  I'(x)  4>'(y) 

EL  -■"<-»[  j 

E'  = 0 

y 

E;  = V-(x)*«(y)^ 

E-  = - v'(x)  <j>'(y) 

E"  = Y n V "(x)  f t ^-r  y-  4>”(y)l 

z z l jweoe(y)  dy  'X'J 

Hi  = V'<*>[j^  Ty  ♦W] 

H"  = Y » V"(x)  4>"(y) 

Hy  = i'(x)  <t>'(y) 

H"  = 0 

y 

= I"(x)  4>»(y) 

|jV(,).-ikiZiI,|,) 

y-  I'(x)  = - j k ’ Y'  V'(x) 
dx  ' ' J x x ' 

~ V "(x)  = -jk»  ZM"(x) 

^ I"(x)  =-jk^Y»V(x) 

Z’  = l/Y'  = Z sin  0' 

X X s 

Z'  = l/Y'  = Z cos  0' 
z'z  s 

z;  = «h0/k' 

Y"  = l/Z"  = Y"  sin  6" 

X X s 

Y"  = l/Z"  = Y"  cos  0" 

z z s 

Y"  = UE  /k" 
s o 

_I’(0)  = P'JJ(O)  (1) 

V"(0)  = P"  V»(0)  (2) 

R'_L'(0)  +_L"(0)  = S'T'(O)  +Q"T"(0)  (3) 

V'(0)  + R"  V"(0)  = Q'  V'(0)  + S"  V"(0)  (4) 

where_I_'(0)  and  V/(0)  are  infinite  column  vectors  consisting  of  the  TE  modal  current  and 
voltage  of  the  waveguide  on  the  left  and  evaluated  at  x = 0,  andj_"(0)  and  V"(0)  are  those 
for  the  TM  modes.  The  quantities  with  a superbar  are  those  for  the  waveguide  on  the 
left.  The  coefficient  matrices  P,  Q,  R and  S with  either  a single  prime  or  a double 
prime  are  related  to  the  scalar  products  of  the  known  mode  functions.  Evidently,  Eqs. 
(3)  and  (4)  include  the  effect  of  coupling  between  TE  and  TM  modes.  The  analysis  above 
is  exact  for  the  structure  under  consideration. 
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We  intend  to  utilize  the  analysis  discussed  above  to  obtain  numerical  values  for 
the  scattering  characteristics  of  a variety  of  different  dielectric  step  discontinuities, 
particularly  those  which  arise  naturally  as  the  sides  of  waveguiding  structures.  How- 
ever, we  have  already  obtained  approximate  numerical  values  for  structures  corre- 
sponding to  some  waveguides  used  in  integrated  optics  and  millimeter  waves,  under 
the  condition  that  the  continuous  spectrum  is  neglected.  Thus,  only  the  surface  wave 
modes  are  included  in  the  solution,  and  the  problem  simplifies  considerably.  Those 
solutions  will  not  yield  information  on  the  radiation  pattern,  but  they  will  tell  us  what 
major  effects  are  produced  by  the  TE- TM  mode  coupling  introduced  by  the  oblique 
angle  incidence.  Later,  using  the  complete  solution  above,  we  can  determine  the  added 
influence  of  the  continuous  spectrum. 

One  of  the  step  junctions  analyzed  is  the  side  of  the  inverted  strip  waveguide  for 
millimeter  waves,  a structure  proposed  by  Itoh.  As  an  example  of  those  numerical 
calculations,  which  were  obtained  for  a variety  of  parameters  values,  we  present  in 
Fig.  4 the  scattering  characteristics  obtained  for  TE  mode  incidence,  since  they  illus- 


Fig.  4.  Relative  amplitudes  of  the  reflected  and  transmitted  surface  wave 

modes  when  a TE  surface  wave  mode  is  incident  at  an  onlique  angle 
0 on  the  dielectric  step  junction  shown  (corresponding  to  a side  of 
the  inverted  strip  waveguide  for  mm  waves).  Note  the  presence 
of  a null  corresponding  to  a Brewster  angle. 
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trate  an  interesting  physical  effect.  The  physical  dimensions  of  the  structure  are  shown 
in  the  insert,  and  are  chosen  such  that  each  of  the  two  uniform  waveguiding  regions 
supports  only  the  fundamental  TE  and  TM  modes  at  the  frequency  of  30  GHz.  For  the 
fundamental  TE  surface  wave  incident  from  the  right  of  the  discontinuity,  the  relative 
amplitudes  of  the  scattered  fundamental  modes  are  shown  as  a function  of  the  incident 
angle  0.  From  Fig.  4,  we  observe  that  there  exists  a null  (near  9 =50°)  in  the  specular 
reflection  coefficient,  as  the  incident  angle  is  scanned.  No  such  null  reflection  is  ob- 
served for  the  other  polarization.  Such  a null  reflection  for  "TE"  surface  wave  incidence 
may  be  interpreted  as  the  analogue  of  the  Brewster  angle  effect  which  is  known  to 
exist  in  the  case  of  TM  plane  wave  scattering  by  an  interface  between  two  dielectric 
media.  Possible  practical  implications  of  the  Brewster  angle  effect  on  waveguiding 
characteristics  are  still  being  investigated. 
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NEW  PROPAGATION  EFFECTS  FOR  THE  INVERTED  STRIP  DIELECTRIC 
WAVEGUIDE  FOR  MILLIMETER  WAVES 

A.  A.  Oliner,  S.  T.  Peng  and  J.  P.  Hsu 

Associated  with  the  resurgence  in  interest  during  the  past  few  years  in  millimeter 
waves,  several  novel  waveguiding  structures  have  been  proposed  which  are  suitable 
for  millimeter  wave  integrated  circuits  but  which  avoid  the  high  loss  associated  with 
the  metallic  strip  of  microstrip  line  at  these  high  frequencies.  One  of  the  most 
promising  of  these  is  the  inverted  strip  dielectric  waveguide,  '*  2 the  cross  section  of 
which  is  shown  in  Figure  1.  This  waveguide,  which  was  proposed  by  Itoh,  has  the 
added  virtue  that  the  loss  due  to  the  metallic  ground  plane  is  also  minimized,  since 
the  dielectric  constant  values  are  so  chosen  that  most  of  the  energy  resides  in  the 
upper  dielectric  sheet.  The  dielectric  strip  serves  to  confine  the  guided  wave  laterally, 
but  the  field  in  the  strip  region  is  evanescent  vertically,  with  the  result  that  the  field 
becomes  small  when  it  finally  reaches  the  metallic  ground  plane,  and  the  ground  plane 
current  remains  low  as  a consequence. 


X 


guiding  layer  («2) 
dielectric  strip  (*,) 


T 


ground  plane 


Fig.  1 Cross  section  of  the  inverted 
strip  dielectric  waveguide 
for  millimeter  waves. 

The  published  theoretical  analysis*  for  the  propagation  characteristics  of  this 
waveguide  type  is  approximate,  but  it  serves  well  enough  to  produce  reasonable  agree- 
ment with  available  experimental  results.  * However,  when  a more  careful  examination 
is  made  of  the  comparison  between  theory  and  measurements,  it  is  seen  that  the  agree- 
ment is  poor  in  some  regions  and  that,  in  fact,  some  puzzling  discrepancies  appear. 

We  have  conducted  an  improved  analysis  of  the  behavior  of  the  structure,  and  we  have 
shown  that  this  improved  theory  not  only  produces  better  agreement  with  measurement 
but  also  reveals  a number  of  very  interesting  and  fundamental  propagation  effects 
which  the  earlier  approximate  analysis  misses  entirely. 

The  approximate  theoretical  analysis'  mentioned  above  utilizes  the  so-called 
"equivalent  dielectric  constant"  (edc)  method,  which  is  essentially  a simplified  trans- 
verse resonance  procedure  that  assumes  the  presence  of  only  one  mode,  the  lowest  TM 
mode,  in  each  of  the  constituent  transverse  regions  comprising  the  guide  cross  section. 
Furthermore,  in  the  transverse  resonance  taken  under  these  conditions  in  the  horizontal 
direction,  the  geometrical  discontinuities  present  at  the  strip  sides  are  completely 
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neglected  since  each  constituent  region  is  viewed  as  possessing  an  "equivalent 
dielectric  constant.  " 

The  first  thing  that  a more  accurate  analysis  reveals  is  that  in  each  constituent 
region  in  the  guide's  cross  section  at  least  two  propagating  modes,  one  TM  and  the 
other  TE,  must  be  present  simultaneously  under  almost  all  conditions.  In  fact,  if  the 
dielectric  sheet  or  dielectric  strip  thickness  were  increased,  then  the  number  of 
modes  present  simultaneously  would  increase  to  four  or  to  six,  etc.  When  we  examine 
the  three  structures  actually  analyzed  and  measured  by  Itoh,  1 we  find  that  four  simul- 
taneous modes  are  present  in  two  of  them,  and  that  six  simultaneous  modes  are 
present  in  part  of  the  third  guiding  structure. 

The  second  key  point  that  a more  accurate  analysis  indicates  is  that  the  TE  and 
TM  modes  which  are  present  necessarily  couple  to  each  other  at  the  geometrical  dis- 
continuities corresponding  to  the  sides  of  the  strip.  This  coupling  requirement  is 
readily  proven  by  an  examination  of  the  field  components  that  must  be  present  at 
these  strip  sides. 

The  simultaneous  presence  of  these  other  modes  in  the  constituent  regions,  and 
their  coupling  at  the  strip  sides,,  produce  a number  of  interesting  and  sometimes  im- 
portant physical  effects  now  observed  earlier.  Among  these  effects  are  the  following: 

(a)  Under  appropriate  conditions,  leakage  from  the  strip  sides  can  occur, 
changing  the  dominant  bound  mode  into  a leaky  mode. 

(b)  Under  appropriate  conditions,  for  certain  frequencies  or  strip  widths, 
resonance  effects  can  occur. 

(c)  The  transverse  field  behavior,  both  vertically  and  horizontally,  can  differ 
significantly  from  what  one  expects  on  the  basis  of  the  dominant  mode 
alone. 

(d)  If  discrete  modes  higher  than  the  lowest  TE  and  TM  ones  are  present  on 
the  line,  they  will  almost  always  be  leaky. 

These  previously  unexpected  propagation  effects  could  produce  unwanted  per- 
formance difficulties  under  some  circumstances,  or  else  advantage  could  be  taken  of 
them  if  it  is  known  in  advance  what  the  effects  are  and  when  they  occur.  Furthermore, 
by  a proper  choice  of  dimensions  or  dielectric  constants  these  effects  can  either  be 
made  to  appear  or  be  eliminated. 

We  present  in  Figure  2 calculations  for  the  basic  surface  wave  characteristics  of 
the  constituent  regions  comprising  the  inverted  strip  guide  cross  section.  The  calcu- 
lations are  in  the  form  of  the  "effective  dielectric  constant,  "e  which  is  proportional 
to  the  propagation  constant,  as  a function  of  the  dimensions  for  the  specific  parameters 
chosen  by  Itoh1  for  his  three  waveguides.  (Actually,  the  plots  are  given  as 
tteCf  = ( € ef f)  vs.  h»  which  is  the  height  of  the  strip,  when  d,  the  height  of  the  upper 
layer,  f,  the  frequency,  and  the  dielectric  constant  values  in  each  region  are  specified.) 
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The  two  curves  labeled  =1.0  correspond  to  the  constituent  portions  of  the  guide 
cross  section  which  are  outside  of  the  strip  portion,  whereas  the  two  curves  labeled 
t ^ = 2.  1 hold  for  the  strip  region  itself.  The  three  cases  examined  by  Itoh  cor- 
respond to  h = 0.  794  mm,  1.  588  mm  and  3.  175  mm.  Both  inside  and  outside  of  the 
strip  region,  at  least  4 modes  are  present  simultaneously,  2 TE  modes  and  2 TM 
modes.  In  his  analysis,  Itoh  assumed  that  only  the  lowest  TM  mode  was  present. 

Several  other  interesting  features  al3o  follow  from  Figure  2.  Note  that  the 
lowest  TM  mode,  which  is  the  mode  incident  and  the  one  which  contains  most  of  the 
power,  is  no  longer  the  "dominant"  mode  in  the  sense  that  it  is  the  slowest  mode,  as 
it  is  for  a single  dielectric  layer  on  a ground  plane.  As  h increases,  a cross-over 
occurs,  and  the  TE  mode  becomes  the  "dominant"  one.  It  is  the  occurrence  of  this 
interesting  cross-over  that  makes  possible  the  leakage  and  the  resonance  effects. 


Fig.  2.  Basic  surface  wave  characteristics  for  the  con- 
stituent regions  cohiprising  the  guide  cross  section. 
"Effective  dielectric  constant"  values  as  a function 
of  strip  height  for  all  of  the  discrete  modes  existing 
at  that  frequency  (81.7  GHz). 
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The  curves  in  Figure  2 are  used  in  a simple  way  to  determine  when  these 
leakage  and  resonance  effects  will  appear.  The  procedure  follows  the  one  developed 
for  a similar  purpose  in  the  context  of  integrated  optics.  3 As  mentioned  earlier,  the 
leakage  and  resonance  effects  arise  because  of  the  TE-TM  coupling  which  occurs  at 
the  strip  sides.  The  TM  wave  inside  the  strip  region,  which  is  incident  at  an  angle 
on  the  strip  side,  excites  a reflected  propagating  TM  wave  inside,  an  evanescent  TM 
wave  outside,  and  (due  to  the  coupling)  TE  waves  both  inside  and  outside  the  strip 
region.  If  the  TE  wave  excited  outside  is  propagating  rather  than  evanescent,  it 
carries  away  a small  amount  of  energy  at  each  reflection,  resulting  in  a leaky  mode 
rather  than  a purely  bound  mode.  If  the  TE  wave  inside  is  excited  at  an  appropriate 
angle,  then  resonance  effects  can  occur  at  certain  strip  widths.  (Similar  leakage 
and  resonance  behaviors  have  been  reported  for  strip  waveguides  for  integrated 
optics,  but  certain  differences  are  present  in  this  millimeter  wave  structure.) 

One  or  the  other  of  thesu  special  effects  can  be  present  (or  absent)  depending  on 
the  precise  dimensions  of  the  structure.  Table  I summarizes  the  resonance  and 
leakage  behavior  for  three  sets  of  dimensions  for  the  inverted  strip  waveguide,  using 
the  dielectric  constant  values  employed  by  Itoh. 

TABLE  I 

Basic  mode:  lowest  TM 
= 2.1,  t ^ = 3.8 


For  case  1,  we  see  that  leakage  can  occur  without  resonance  effects.  However, 
the  leakage  may  be  eliminated  if  the  strip  width  is  increased  beyond  about  0.  3 \ , 
which  is  not  difficult  to  accomplish.  On  the  other  hand,  in  case  2 we  observe  both 
resonance  and  leakage,  where  the  leakage  would  be  present  over  a rather  wide  range 
of  values  of  W/\  . In  case  3,  neither  effect  is  present,  showing  that  it  is  certainly 
possible  to  design  the  waveguide  without  encountering  these  problems. 
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INTERACTIVE  MECHANISMS  AND  EFFECTS  OF  LOW-LEVEL  MILLIMETER  WAVES 
ON  LIVING  SYSTEMS 

S.  Motzkin,  S.  W.  Rosenthal,  L,  Birenbaum,  R.  Melnick,  C.  Rubenstein,  R. 

Remilly  and  S.  Davidow 

Preliminary  results  of  an  experimental  study  of  the  effects  of  single  CW  mm  - 
wave  exposure  on  the  physiological  and  metabolic  processes  of  prokaryotic  and 
eukaryotic  cells  are  reported. 

A millimeter  wave  exposure  system  has  been  developed  to  be  used  over  a 
frequency  range  of  26.  5-7  5GHz  and  which  can  further  be  extended  to  110  GHz.  The 
system  consists  of  three  separate  waveguide  sizes,  RG-96/U  or  WR-28  (Figs.  1,2) 

(26.  5-40  GHz,  11.  3 - 7.  5 mm),  RG  - 97/U  or  WR  - 22  (Figs.  3,  4)  (33-50  GHz, 

9.  1-6.  0mm),  and  RG-98/U  or  WR  - 15,  (50-75  GHz,  6.0-4  mm).  The  above  systems 
terminate  in  a horn,  the  aperture  of  which  is  placed  about  2 mm  above  the  experi- 
mental material  to  be  exposed.  Biological  samples  are  either  condensed  on  a filter 
paper  surface  or  in  solution  in  a wate r- jacketed,  temperature  controlled  plexiglass 
cell  (Figures  5,6).  Standard  millimeter  waveguide  components  are  used  for 
dosimetry  and  reproducibility  in  this  system  and  thus  insure  a known  exposure  of 
the  cellular  material  (Figures  7,  8). 


Cells  and  subcellular  components  which  are  currently  under  investigation 
include  E.  Coli  (strains  W3110  with  Col  E^  factor  and  W1485,  rat  liver  mitochondria 
and  red  blood  cells.  Earlier  work,  carried  out  by  scientists  in  the  Soviet 
Union^  ^ ^ near  50  GHz  at  low  power  levels  revealed  specific  frequency 

dependent  effects  which  are  cyclical  and  suggest  resonance  phenomena. 


A. E.Coli  Experiments 

Colicin,  an  antibiotic,  is  synthesized  in  bacterial  strains  which  have  a plasmid 
that  contains  colicinogenic  factor.  Although  present.  Col  E^  cells  normally  do  not 
actively  produce  large  quantities  of  colicin.  However,  if  these  cells  are  incubated 
above  43°  C,  treated  with  mitomycin  C,  or  exposed  to  ultraviolet  light,  then  active 
colicin  synthesis  and  replication  of  DNA  molecules  which  code  for  Col  E^  occurs. 
Russian  scientists  have  reported  that  millimeter  waves  can  stimulate  increased 
colicin  production  in  a frequency  dependent  manner,  which  is  almost  insensitive  to 
power  densities  over  2 orders  of  magnitude.  ^ Attempts  to  reproduce  these  experi- 
ments are  in  progress. 

Col  Ej  cells  in  log  and  in  stationary  phase  are  to  be  irradiated  at  intervals  in 
the  51.  3-52.  2 GHz  frequency  range  (5.  85  to  5.  75  mm)  for  30  to  60  minutes,  at 
incident  power  densities  of  approximately  .001  mW/cm^  at  25°,  37°,  and  43°C. 
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Col  Ej  cells  are  irradiated  in  a water- jacketed,  temperature  controlled  cell 
(Figure  5).  Exposed  Col  E^  cells  are  grown,  concentrated  and  thinly  spread  over 
a nutrient  agar  plate  to  achieve  a homogeneous  lawn.  These  cells  are  chloroformed 
and  Col  sensitive  bacteria  are  then  inoculated  onto  the  plate  (Figure  9).  Clear  areas 
or  plaques  (Fig.  10),  which  develop  during  the  subsequent  growth  period  are  scored. 

These  areas  are  indicative  of  the  destruction  of  Col  sensitive  bacteria.  Results  in 
irradiated  specimens  are  compared  with  controls  of  Col  E^  cells  grown  alone,  and 
of  uninduced  Col  E^  and  sensitive  bacteria  grown  together  (Figure  11).  How  the 
amount  of  colicin  is  related  to  the  plaque  size  has  yet  to  be  determined. 

Preliminary  observations  indicate  that  colicin  production  is  enhanced  at 
specific  frequencies  at  25°  and  37°C.  Earlier  investigators  have  reported  that 
temperature  above  43°  induces  colicin  production.  Below  this  threshold,  increases  induce 
colicin  production  only  at  specific  frequencies.  Even  in  unirradiated  controls  some 
colicin  production  is  evident.  In  experiments  reported  here,  colicin  production  is 
doubled  in  60  minutes  at  37°C  and  a power  density  of  , 5mW/cm^  following  irradiation. 

B.  Experiments  with  Mitochondria 

Although  the  mechanism  by  which  weak  electromagnetic  fields  act  on  membranes 

is  unknown,  several  suggestions  have  been  advanced  including:  modification  of 

charged  binding  sites  by  conformational  changes  or  displacement  of  surface  bound 

ions,  transformations  of  the  hydrated  phase  of  globular  proteins'^  (which  are  strongly 

dependent  on  divalent  cations  and  which  could  in  turn  propagate  or  amplify  local 

electrical  events),  changes  in  membrane  potential,  ^and  stimulation  of  coherent 

7 8 

molecular  oscillations.  ’ Mitochondrial  membranes  are  being  utilized  in  these 
experiments  because  they  are  well  characterized.  Exposing  them  to  millimeter 
waves  will  permit  us  to  define  possible  alterations  in  their  macromolecular  interactions 
and  to  determine  the  significance  of  these  variations  on  the  functional  capacity  of 
living  cells. 

These  organelles,  extracted  from  rat  liver  cells,  and  centrifuged  to  a paste- 
like consistency  with  a concentration  of  70  mg/ml  protein,  are  placed  in  a water- 
jacketed  accuvette  (Fig.  6)  for  exposure.  Irradiations  carried  out  at  35  GHz  (8.  6 mm), 
in  preliminary  studies, included  relatively  high  power  densities.  Studies  now  in 

progress  will  include  53.  7-50.  5 GHz  (5.  59-5.  94  mm)  as  well  as  varying  power 

2 

densities  from  0.01-1.0  mW/cm  for  12-15  minutes.  Metabolic  membrane  activity 
is  evluated  by  measuring  total  ATP  synthesis  as  a function  of  oxidative  phosphorylation. 

A technique  has  been  devised  which  will  make  it  possible  to  irradiate  mitochondria  in 
a physiologically  active  state  without  the  interference  of  electrodes  imposed  in  the 


- 
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electromagnetic  field.  During  the  latter  procedure,  the  concentration  of  experimental 
material  is  reduced  to  1.  5 mg/ml  to  increase  the  duration  of  experimental  time 
available  before  essential  reactants  (e.g.  oxygen)  are  used  up.  ATP  synthesized  is 
trapped  as  glucose- 6-phosphate  using  glucose  and  hexokinase  to  prevent  ATP  hydrolysis. 
Subsequently,  glucose-6-phosphate  concentration  is  determined  by  evaluating  NADP 
reduction  with  glucose-6-phosphate  dehydrogenase  at  340  nm.  Membrane  integrity  is 
evaluated  as  a function  of  oxidative  phosphorylation  which  is  a complex  process  in- 
volving several  components  in  the  electron  transport  system  and  the  ATPase  complex. 
Alterations  in  the  membrane  which  affect  one  component  will  affect  the  entire  process. 

To  date,  drastic  changes  in  respiratory  control  have  not  been  observed  by  us 
except  with  35  GHz  at  very  high  power  densities  where  irreversible  membrane  injury 
has  certainly  occurred.  Preliminary  results  show  no  effects  at  35GHz,  at  power 
densities  of  10  mW/cm  or  less.  In  experiments  where  effects  have  not  been  observed, 
radiation  injury  incurred  couid  possibly  have  been  repaired.  Therefore,  irradiations 
and  analyses  will  be  carried  out  on  actively  phosphorylating  mitochondira  so  that 
evaluations  can  be  accomplished  prior  to  repair. 


C.  Experiments  with  Red  Blood  Cells 


To  further  elucidate  the  effects  of  irradiation  on  cellular  membranes,  preliminary 


studies  of  red  blood  cells  have  been  undertaken.  It  has  previously  been  reported 
that  5-10  mW/cm  change  the  permeability  of  blood  cell  membranes  to  hemo- 
globin. One  ml  of  packed  red  blood  cells  in  a water-jacketed,  temperature-controlled 


cell,  is  being  irradiated  at  51.7,  GHz  (5.8  mm)  for  15,  30,  60,  120  minutes  at 
7 mW/cm  and  will  be  irradiated  at  1 and  0.  1 mW/cm2  power  density  levels.  Changes 


in  the  permeability  and  porosity  of  the  membranes  are  being  investigated  by  examining 
for  potassium  ion  and  hemoglobin  leakage.  Exposed  cells  are  centrifuged  and  the 
supernatant  fluid  examined  spectrophotometrically  at  576  nm  for  hemoglobin  absorp- 
tion and  with  a potassium  electrode  to  determine  the  concentrations  of  these  components 
liberated  from  exposed  cells.  Results  will  be  compared  with  those  of  controls  handled 
in  the  same  way  but  not  irradiated. 
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Chamber  I Chamber  II  Dimensions 
(inches)  (inches) 


2.50  4.50  a 

1.75  2.625  b 

0.75  2.50  c 

4.25  5.125  d 


NOTES: 

1.  The  above  is  a prototype  of  plexi- 
glass chambers  used  in  irradia- 
tions. Fabrication  includes  one 
with  small  Chamber  l for  red  blood 
cells  and  a larger  Chamber  II 

to  accommodate  perti  dish  with 
E Coli. 

2.  Measurements  of  various  dimen- 
sions for  the  two  chambers  are 
noted  in  inches . 


Fig.  5.  Water-jacketed  sample  chamber. 
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REFLECTED  INCIDENT 

POWER  MONITOR  POWER  MONITOR 


Eig.  7.  Block  diagram  of  35  GHz  set  up. 


CONTROLLED 


Fig.  8.  Block  diagram  of  BWO  set  up. 


Dimensions  of  area  under  square  = cm  ; 
Light  areas  - clear  areas  where  colicino 
genic  bacteria  have  destroyed  indicator 
bacteria;  Large  areas  = .8mm;  Small 
areas  - .2  - ,5  areas. 
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ACOUSTOELECTRIC  REAL  TIME  CORRELATOR 
L.  Rosenheck,  H.  Schachter  and  W.  C.  Wang 

A.  Introduction 

It  has  been  demonstrated  experimentally  that  surface  acoustic  wave  correlation 
of  two  signals  can  be  performed  by  using  space  charge  nonlinearity  in  a structure  con- 
sisting of  a thin  semiconductor  layer  inserted  between  two  different  piezoelectric  sub- 
strates; LiNbOj- Si-BGO.  The  two  surface  waves  are  collinear  and  propagate  in  the 
same  direction.  Both  the  theoretical  analysis  and  computer  results  are  presented 
here.  In  the  analysis,  1)  the  diffusion  effect  is  included  and  2)  neither  thin  film  nor 

thick  semiconductor  layer  approximation  is  used,  since  d<  X , d > X where  d,  X 

3-  JJ  3 

and  XD  are  the  respective  semiconductor  thickness,  acoustic  wave  length  and  Debye 
length.  The  charge  distribution  and  the  nonlinear  interaction  strength  as  a function  of 
semiconductor  thickness  (thin  film  case  included)  and  resistivity  will  also  be  discussed. 

In  this  paper  we  study  the  optimum  acoustoelectric  voltage  across  the  structure 
shown  in  Fig.  1.  This  structure  consists  of  two  semi-infinite  piezoelectric  substrates, 

u>.  = cj.  -(»>_  Correlation 

OLD  _ 

Output 


Fig.  I.  Structure  of  acoustoelectric  real  time  correlator. 


one  of  LiNbOj  [01 0]  cut  [001]  propagating  and  the  other  of  Bi12GeO20  [001]  cut  [110] 
propagating.  Between  the  two  substrates  is  a thin  Si  plate  which  is  sandwiched  in  such 
a way  that  there  is  an  air  gap  of  distance  hf  between  the  Bi^GeO^  and  the  Si  and  one 
of  h2  between  the  LiNbO^  and  the  semiconductor. 

The  role  of  the  semiconductor  is  to  couple  two  collinear  surface  waves  which 

propagate  in  the  same  direction,  one  on  the  BGO  and  the  other  on  the  LiNbO,  surface 

12  3 

via  space  charge  nonlinearity.  ’ This  gives  use  to  an  open  circuit  voltage  at  a frequency 
equal  to  the  difference  between  frequency  of  the  acoustic  wave  traveling  on  the  LiNbO^, 
u>2>  and  the  BGO  traveling  wave  w^. 
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B.  Theoretical  Consideration 

1.  Fields  and  Current  in  the  Semiconductor 

We  will  assume  an  input  signal  f(t)  = 2Acos(oj.  t)  at  the  LiNbO,  transducer  and 

J-1  3 

g(t)  = 2B  cos(wgt)  at  the  BGO  transducer.  The  two  input  signals  will  excite  acoustic 
traveling  waves  in  the  LiNb03  and  the  BGO  with  velocities  of  v and  v0l  respectively. 
The  accompanying  electric  fields  will  give  rise,  in  a first  order,  to  electric  fields 
and  charges  in  the  silicon  traveling  with  the  same  velocity  as  the  electric  fields  in  the 
LiNbO^  and  BGO. 

As  depicted  in  Fig.  2 it  is  convenient  for  one  to  treat  this  layered  structure  as 
two  separate  problems  and  at  a later  time  consider  the  nonlinear  interaction.  Following 
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Fig.  2.  Analytical  geometry  for  BGO-Si-LiNbO^  layered  correlator. 

a method  similar  to  that  of  Lakin  and  Shaw3  we  first  consider  the  case  where  LiNbO^ 
is  the  source  and  BGO  is  the  load.  This  gives  use  to  a differential  equation  within  the 
semiconductor 


♦iV-ki<i  + a2>£+kla2*s=0 

The  solution  of  which  is  the  potential 

o aikLy  •>(“Lt'kLz) 
♦s(t,y,*)  = l A e L e L L +c.c. 

i=  1 


(1) 


(2) 


Similarly  for  LiNbO^  as  the  load  and  BGO  as  the  source  we  get  a differential  equation 
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*;IV-4(i+02)  *f  + 4e2  < = 0 

the  solution  of  which  is 


<(t,y.*)  = V B.e  e B “ +C.C. 

8 J 


where 


, a + ju  € 

± 1 0,  ,=  t 1 aZ=  1 + hj- 


C Dk, 
s L 


a3.4=  **  e3.4=±  p e =1  + 


0 + j“Bes 


CsDkB 


The  A^,^  and  Bjlg  coefficients  are  determined  by  using  the  configuration  shown 
in  Fig.  2 and  following  boundary  conditions:  Jy(0)  = 0,  Jy(-d)  = 0 the  potential  and  dis- 
placement field  are  continuous  at  0 and  -d.  We  also  note  that  <p  (y=  h,)  and  <pi(y'=  h ) 

c B'7  1 ' 

represent  the  source  potential  proportional  to  the  acoustic  input  power.  Associated 
with  the  potential  ^ and  ty'  one  can  easily  find  the  corresponding  electric  field  and 
charges  in  the  semiconductor 

* ®iki  y 

EL"  Z Aie  e (jkL*  - O-k^y)  + c.c.  (5) 


i 2 2 aikLy  j("it-kI*) 

* Z k^O  - O.  ) A.e  e + c.  c. 


Similarly, 


-p.k^y  + d)  j(w  t-k^z) 

EB=  Z Bje  e Hkgi  + + C*' 

j=l 

X 2 2 -3;k,(y+d)  j(u>  t-k^z) 

PB  = Cs  Z 4(1-PPV  J « ^ + c.c. 


The  coupling  of  the  electric  fields  due  to  the  UNbOj  SAW  with  the  charges  due 
the  SAW  propagating  on  the  BGO  and  vice  versa  will  give  rise  to  nonlinear  currents  in 
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the  semiconductor.  If  the  frequencies  and  are  chosen  such  that  = kg 
“B  “l 

(or  — = - — ) then  the  nonlinear  current  will  be  stationary  and  independent  of  z.  The 
B Li 

nonlinear  current  Jj^  is  given  by 

JNL  = m*EL  pB  + EB  P]J  + c'c< 
where  |J  is  the  mobility  of  the  semiconductor. 

If  we  substitute  Eqs.  (5)  to  (8)  into  Eq.  (9)  we  find  that 


3 X $ * * * «V  Pi)kY  -0Tkd  iui  t 

Mk' * £ £ A.B.(p.-ai)(aie%l)e  1 J e J e^’d1 


J ... » ” 6 
NL  s 


1=1  j=l 


where  k=  k^=  kg  and  ud  = «L-“g. 

We  can  assume  that  the  potential  at  the  difference  frequency  is  given  by 
ju  t 

*d(y.  0 = tPd(y)  e (H) 

and  from  the  equation  of  current  in  the  semiconductor  in  one  dimension  (because  there 
is  no  variation  with  z) 


J , = OE  . + J...  - D -rr-2 

dy  dy  NL  n 5y 


and  the  continuity  equation 


V-  J , = - 


d ~5t 


.dj  , 

b?- 


J“desCd 


We  finally  get 


dZE 

h (oEy.»d-  e.Dn  jnl)=  0 

the  solution  of  which  is 

4 4 (a.  - p*)ky 

ELy,w  = Eo+  EiSinhY(y+d)  + E2  sinh  Yy  + £ £ K. . e 1 J 


i=l  j=l 


where 
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K = Mk3  AiBj(P--ai)(l^aigt)e'kejd 
lJ  ^ Y2} 


(16) 


and 


y2_  o “d. 

ns  c 


In  order  to  solve  for  Eq.  E,.  and  E2  we  use  the  boundary  conditions 

e E (0)  = c E f-d)  s e r 
s y,wd'  *-3  y,«d(  a'  eoE 


and 


Jyd<°)  = Jyd<-d>  - 0 

where  E is  the  electric  field  at  the  frequency  wd  in  the  adjacent 

E - J“dC°E 

° o + j«des 


medium.  We  obtain 


(17) 


®7EWc  * £ K.. 

1 ■tnhYd(«  +jw  ) ' Z Z HTtffva 

c d 1=1  j=l 


E.  = 


(18) 


E2  smh  Yd(«c  + jw  ) + Z E ■'lJ  sink  Yd 

i=  1 j=  1 

The  open  circuit  voltage  (VQ  c ) is  then  found  by  integrating  E, 
semiconductor  and  retaining  the  nonlinear  part. 

2 an*  'kg.  d # 

v = Hisi  V V AiBie  <1  + Mf) 

o.c  D L L # ? y — 5-*k- 

n i=ij=i  (V^Tk2-/ 


e 

~ Eu 
€s  c 


4 4 K..e'(ai-ej)kd 


y.“ 


(19) 


across  the 


(20) 


r(ej*-ai,k  Yd 

• j — ^ ^ tank  -j- 
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It  is  apparent  that  the  open  circuit  voltage  should  have  an  optimum  at  a particular 
frequency  for  a particular  semiconductor  thickness.  If  the  semiconductor  is  "thick" 
we  would  expect  little  or  no  coupling  between  the  electric  field  and  the  space  charges 
of  the  LiNbOj  and  the  BGO.  Similarly,  if  the  semiconductor  is  thin  the  area  of  inter- 
action is  small. 


If  we  consider  Fig.  3,  we  see  that  indeed  there  is  an  optimum  thickness  d for  a 
particular  frequency.  It  is  also  apparent  that  if  one  was  interested  in  an  optimum 

voltage  regardless  of  the  frequency  or  the  semiconductor  thickness  it  would  only  be 

ll.OOr 

/ \ wL  (RAD/SEC) 

I \ 2ttXI08 

_y  I \ 2ttXI07 

y.?  8-25  / \ 27TXIO®  (multiply  verticil  axil 

is \'t  A I \ scale  by  0.005) 


SEMICONDUCTOR  THICKNESS  VdXI0',J 
Fig.  3.  Correlation  output  as  a function  of  semiconductor 
thickness  (d)  with  frequency  (w^)  as  a parameter. 

necessary  to  vary  the  thickness  and  the  frequency  until  V was  an  absolute  maximum 
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AN  ACOUSTOELECTRIC  FM  DEMODULATOR 
H.  Schachter,  W.  C.  Wang,  F.  Cassara  and  L.  Rosenheck 

Acoustoelectric  signal  processors  such  as  the  convolver  and  correlator  are 
built  based  on  (i)  strong  space  charge  nonlinearity  induced  by  SAWs  in  an  adjacent 

(semiconductor  and  (ii)  the  filtering  action  generated  by  spatial  integration  over  the 

length  of  nonlinear  interaction.  Utilizing  these  special  features  again,  a new  type  of 
FM  demodulator  has  been  developed.  This  type  of  demodulator  is  structurely  simple. 
Si  on  LiNbO^  and  can  be  operated  at  a carrier  frequency  in  excess  of  100  Mhz. 

Figure  1(a)  describes  the  basic  device  geometry.  A frequency  modulated 
signal,  f(t)=  A cos  (»ct+  Aw  J g(r)dr)  is  applied  simultaneously  to  the  pair  of 
interdigital  transducers  T^  and  T ^ The  transducers  are  separated  by  a small 
distance  t . The  applied  FM  signal  will  induce  two  collinear  surface  acoustic  waves 
which  are  propagating  in  the  same  direction  and  with  a time  separation  of  t = ^/v. 
v is  the  SAW  velocity.  When  the  two  waves  propagate  under  semiconductor,  the 
piezoelectric  field  associated  with  the  waves  will  induce  space  charge  waves  inside 

the  semiconductor.  Through  the  nonlinear  interactions  among  the  induced  space 

IS 

charge  waves  and  the  piezoelectric  field  waves,  many  higher  order  signals  are 

generated.  However,  after  integrating  along  the  semiconductor  length  L,  only  the 

term  which  performs  the  FM  demodulation  is  of  significant  amplitude  and  can  be 

detected  across  the  semiconductor  terminals.  For  simplicity  and  clarity,  we  will 

examine  the  case  of  a single  tone  FM  signal;  i.  e.  , g(r)  = cos  u t.  The  sum  of  the 

2 

input  signal  and  its  delay  squared,  { f (t ) + f (t -tQ ) } will  give  rise  to  several  second 
order  nonlinear  mixing  terms.  It  can  be  shown  that  the  only  term  which  is  not 
negligible  and  of  interest  is: 

2f(t)  f(t-t  )=2ABcos|u  t + /3sinu  tjcoslu  (t-t  ) + /3sinu  (t-t  )J 
o'  ■ c m ’ • c'  o H m o • 

= AB  cosjucto  + [2/3sin  '^--jcos^^t  - n^°)}  (1  ) 

(J  t u t 

+ ABcosj2u  t -u  t +[2/3 cos  ri°1sin(u)  t ) { 

•c  col  2J  V m 2 71 

where  A and  B are  associated  with  the  amplitude  of  f (t ) and  f (t -tQ ) respectively. 

B,  u , Au  and  u are  the  respective  modulation  index  (Aw/u  ),  modulation  rate, 
me  m 

peak  frequency  deviation  and  carrier  frequency.  The  open  circuit  voltage  across  the 
semiconductor  is  then  of  the  form 
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where  K ^ is  a constant  in  terms  of  the  2nd  order  nonlinear  coefficient  and  other 

device  parameters  such  as  carrier  density,  mobility  and  etc.  is  the  Bessel 

function  of  nth  integer  order.  Equation  (2)  reveals  that  the  novel  SAW  device  does 

indeed  demodulate  the  received  FM  signal  (n=  +_!  terms).  All  other  terms  are 

negligible  provided  Aw  t <<  1 since  J ( Aw  t )<<J,(Awt  ) for  n>2.  It  is  noted  in  the 

o no  1 O *“  • _ 

above  expression  that  the  action  of  spatial  filtering  comes  in  the  form  of  . As  a 

t 0 3 

numerical  example,  let  us  consider  the  case  — a 3 4 sec,  w a 811  X 10 

rad.  / sec.  and  w a 2 n X 1 0®  rad.  / sec.  The — r — 2-  factor  (0  = nti  L/ 2v,  associated 
c Q n m 

with  J ) approaches  to  i for  n < 2,  and  the  factor1  sin  6 / 0 (associated  with  J ) 
n p p p 

approaches  to  zero  for  any  p.  Therefore,  the  only  term  remaining  in  the  open 

circuit  voltage  expression  is  that  related  with  the  modulation  signal.  Further,  it  is 

noted  that  2/3  sin  (w  t / 2)=  Awt  , since  w t / 2 < < 1 . For  linearity,  Aw  t is 

required  to  be  less  than  one.  For  example,  if  a frequency  deviation,  Af  = 20  Mhz  is 

needed,  one  requires  a delay  to  tQ<  5n  sec.  and  a carrier  frequency  f^  =r  1 03  Mhz.  A 

unique  feature  inherent  in  this  device  is  that  one  can  adjust  w t to  select  a set  of 

co 

desired  order,  (even,  odd  or  both)  of  Bessel  functions  to  operate  with.  Experimental 

results  are  in  agreement  with  the  derived  expression. 

A typical  set  of  experiments  are  described  here  in  which  t is  set  at  0.  34  u sec. 

o 

The  oscilograms  in  Fig.  2 are  taken  at  constant  carrier  frequency,  95  Mhz  at  varying 
deviations.  (At  this  carrier  frequency  only  odd  harmonics  are  observed.  ) The 
observed  disappearances  (first  zeros)  of  fundamental,  third  harmonic  and  fifth 
harmonic  are  corresponding  to  AwtQ=  3.  74,  6.  4 and  8.  8 respectively.  They  are 

compared  with  theoretical  predictions  of  AwtQ=  3.  81,  6.  38  and  8.  77.  The  oscillo- 
grams in  Fig.  3 are  taken  under  conditions  of  constant  frequency  deviation,  but 
varying  carrier  frequencies.  In  oscillogram  3(a)  only  the  odd  (fundamental)  is 
predominant,  in  3(b)  both  even  and  odd  are  present,  in  3(c)  only  the  even  (second)  is 

predominant.  From  3(a)  to  3(c)  the  change  in  w t is  (w  t ) - (w  t ) = 0.  54  n. 

co  c o a c o c 

Figure  4 gives  the  comparison  between  the  shapes  of  the  input  modulation  signal  g(r) 
and  that  of  the  device's  output  demodulated  signal.  Clearly  successful  FM  demodula- 
tion is  achieved  even  for  the  case  of  nonsinusoidal  modulation. 
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Fig.  4.  Top  trace,  input  modulation  signal  g(r) 

Lower  trace,  demodulated  output  FM  signal. 

(the  180°  phase  inversion  between  input  and  output  can  be 
made  zero  degrees  by  interchanging  the  semiconductor 
output  terminals). 
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A SIMPLE  CRITERION  FOR  PREDICTING  LEAKY  WAVES  ON  RIB  WAVEGUIDES  FOR 
INTEGRATED  OPTICS 

S.  T.  Peng  and  A.  A.  Oliner 

Three-dimensional  waveguides  for  integrated  optics  can  be  divided  into  two  broad 
classes:  those  that  consist  of  a dielectric  strip  which  is  deposited  on  a dielectric  sub- 
strate or  else  a channel  which  is  diffused  or  ion- implanted  into  it,  as  shown  in  Fig.  1(a) 
and  1(b),  and  those  which  utilize  an  additional  thin  dielectric  layer  on  top  of  the  sub- 
strate, as  illustrated  in  Figs.  1(c)  through  1(f).  The  waveguides  in  the  first  class  are 


Fig.  1.  Examples  of  three-dimensional  waveguides  for  integrated  optics. 

Waveguides  (a)  and  (b)  fall  into  the  class  for  which  no  leakage  is 
possible;  the  rest  are  members  of  the  class  for  which  some  modes 
may  leak. 

essentially  modifications  of  a dielectric  rod  waveguide;  if  the  strip  were  absent,  no 
surface  wave  could  be  guided  at  all.  For  the  waveguides  in  the  second  class,  the  ad- 
ditional thin  layer  would  guide  a wide  surface  wave  beam  in  the  absence  of  the  guiding 
strip  region.  The  two  classes  thus  differ  from  each  other  because  of  the  added  thin 
dielectric  layer,  or  film.  The  significance  of  this  distinction  in  connection  with  this 
report  is  that  all  modes  propagating  along  waveguides  in  the  first  class  are  purely 
guided,  with  no  leakage  possible  (assuming  that  the  waveguides  are  uniform  --  leakage 
can  occur  if  the  waveguiding  structure  is  tapered  or  periodically-modulated  along  its 
length).  On  the  other  hand,  as  we  shall  see  below,  some  modes  on  waveguides  of  the 
second  class  are  leaky  modes  under  appropriate  conditions. 

It  is  not  generally  known  that  leaky  modes  can  exist  on  the  large  class  of  wave- 
guides denoted  above  as  the  second  class.  In  that  class,  the  thin  dielectric  film  serves 
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to  confine  the  guided  wave  laterally  to  the  region  of  the  strip.  Examples  of  such  wave- 
guides include  a dielectric  strip  with  refractive  index  less  than  that  of  the  film,1  index 
greater  than  that  of  the  film/  and  equal  to  that  of  the  film/  5 when  it  is  called  a "rib" 

waveguide.  Other  structures  employ  ion- implanted  layers/’^  or  metallic  coatings 

8-11 

everywhere  but  in  the  guiding  region,  and  are  known  as  "slot"  waveguides.  This 

class  of  waveguides  has  also  been  generically  referred  to  as  "slab-coupled"  guides. 

All  of  these  guiding  structures  have  been  built  and  tested  in  the  context  of  directional 
couplers,  modulators  or  switches. 

In  an  earlier  publication/  we  presented  a qualitative  explanation  for  the  leakage 
effects,  an  underlying  analysis  for  an  approximate  evaluation  of  this  leakage,  and  some 
numerical  results  for  the  rib  waveguide  for  some  specific  parameter  values.  We  are 
currently  performing  a more  accurate  analysis,  using  a rigorous  phrasing  of  the  prob- 
lem. In  this  report,  however,  we  are  concerned  with  how  to  predict  when  leakage  will 
occur  without  having  to  first  solve  the  problem  (which  is  quite  complicated).  In  this 
connection  we  present  a simple  but  accurate  criterion  which  is  based  on  the  known  dis- 
persion curves  for  the  constituent  portions  of  the  waveguide's  cross  section. 

It  is  important  for  two  reasons  to  know  whether  or  not  leakage  is  present.  Since 
these  optical  waveguides  are  intended  for  use  in  an  integrated  circuit  fashion,  unwanted 
leakage  can  cause  cross  talk  between  neighboring  components  and  thus  deteriorate  the 
performance  of  the  circuit.  On  the  other  hand,  novel  components  can  be  designed 
which  make  deliberate  use  of  the  leakage  present.  An  example  of  such  a component  for 
integrated  optics  is  a novel  leaky  wave  directional  coupler.1^  This  coupler  consists  cf 
the  usual  two  strip  waveguides  located  parallel  to  each  other  and  spaced  a certain  dis- 
tance apart,  but  here  the  spacing  is  made  so  great  that  the  coupling  would  be  negligible 
if  leakage  were  not  present.  It  is  found  that  the  coupling  is  not  sharply  sensitive  to  the 
separation  between  the  strips,  a feature  which  may  have  practical  import.  Of  even 
greater  interest  is  the  coupler's  potential  use  as  a mode  stripper  or  purifier.  In  gen- 
eral, when  one  wants  to  excite  the  TE  mode  on  the  optical  strip  guide  some  amount  of 
TM  mode  energy  will  inevitably  be  present.  Since.as  will  be  shown  below,  the  lowest 
TM  mode  on  the  optical  strip  waveguide  leaks  and  the  lowest  TE  mode  does  not,  such  a 
leaky  wave  coupler  can  be  used  to  tap  away  all  the  TM  mode  energy  without  disturbing 
the  TE  mode  at  all. 

A,  The  Physical  Basis  for  the  Leakage 

A typical  rib  waveguide  is  shown  in  Fig.  2;  the  material  of  the  strip  is  the  same 
as  that  of  the  thin  film.  The  strip  produces  a central  region  with  an  effective  refractive 
index  greater  than  that  of  the  surrounding  regions;  as  a result,  the  surface  wave  that 
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DIELECTRIC  STRIP 


DIELECTRIC  THIN  FILM 

Fig.  2.  Geometry  of  the  rib  waveguide.  The 
central  strip  and  the  surrounding  thin 
film  are  of  the  same  material. 

is  present  on  the  thin  film  is  pulled  in  by  the  strip  region  and  confined  to  its  neighbor- 
hood. The  guiding  process  is  usually  viewed  in  terms  of  waves  in  the  strip  region 
bouncing  back  and  forth  at  an  angle  between  the  two  sides  of  the  strip,  undergoing  total 
reflection  at  each  bounce. 

The  cross  section  is  considered  to  consist  of  two  constituent  regions,  the  inner 
strip  region  and  the  outer  portions.  Theoretical  analyses  for  the  propagation  charac- 
teristics of  strip  waveguides  have  appeared  in  the  literature/’  ^ but  these  anal- 

yses assume  the  presence  of  only  one  mode  type,  TE  or  TM,  in  each  of  the  constituent 
regions  comprising  the  cross  section  of  the  strip  waveguide.  When  the  field  behavior 
at  the  strip  sides  is  viewed  more  carefully,  however,  it  is  easy  to  see  that  a TE  or  TM 
mode  incident  on  a strip  side  produces  not  only  a reflected  and  transmitted  wave  of  its 
own  type,  but  also  excites  a reflected  and  a transmitted  wave  of  the  other  type,  plus  a 
continuous  spectrum  (which  is  purely  reactive  here).  This  coupling  at  the  strip  sides 
between  TE  and  TM  modes  produces  the  leakage  effects  discussed  here. 

If  we  assume  that  the  dielectric  film  thickness  is  such  that  the  constituent  regions 
in  the  cross  section  can  each  support  only  one  TE  mode  and  one  TM  mode  (which  cor- 
responds to  the  usual  range  of  operation),  then  a transverse  equivalent  network  for  the 
rib  waveguide  which  takes  into  account  the  TE- TM  coupling  mentioned  above  has  the 
form  seen  in  Figure  3.  The  transmission  line  portions  represent  the  independent  modes 
in  the  cross  section,  and  the  step  junction  discontinuities  which  correspond  to  the  strip 
sides  are  represented  by  the  boxes  which  couple  the  transmission  lines. 

Let  us  next  consider  separately  each  of  the  constituent  regions  comprising  the 
guide  cross  section,  as  if  each  were  infinitely  wide.  The  dispersion  curves  for  these 
regions  are  presented  in  Fig.  4 for  both  the  TE  and  TM  modes,  in  the  form  of  the  ef- 
fective refractive  index  n^  ( = p/k  = \/\  ) as  a function  of  t/\,  where  thickness  t is  de- 
fined in  the  inset  drawing.  The  TE  mode  is  seen  to  be  the  dominant  mode,  in  the  sense 
that  it  is  the  slower  mode  (higher  value  of  n ..)  and  it  has  the  smaller  cutoff  frequency 
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Fig.  3.  Transverse  equivalent  network  for  the  rib  waveguide  which 
takes  into  account  the  TE-TM  coupling  produced  at  the  strip 
sides.  The  symbols  i and  o signify  respectively  the  inner 
(strip)  and  outer  (film)  constituent  portions  of  the  waveguide 
cross  section. 


Fig.  4.  Dispersion  curves  for  the  basic  surface  waves 
which  exist  on  the  constituent  (strip  ancf  film) 
portions  of  the  rib  waveguide  cross  section,  if 
each  portion  is  considered  infinitely  wide. 

(or  thickness).  The  curve  for  the  other  polarization  (TM)  is  located  nearby.  The  verti- 
cal line  tj  in  Fig.  3 corresponds  to  the  (thicker)  strip  region,  and  the  line  t.,  to  the 
outside  (film)  regions. 

For  complete  guiding,  the  waves  bouncing  back  and  forth  inside  the  strip  region 
at  an  angle  to  the  strip  sides  must  be  propagating,  and  the  waves  present  outside  must 
be  transversely  evanescent.  In  terms  of  the  transverse  equivalent  network  of  Fig.  3, 
the  inner  TE  and  TM  transmission  lines  must  be  above  cutoff  and  the  outer  TE  and  TM 
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transmission  lines  must  be  below  cutoff.  Suppose  that  the  waveguide  is  excited  with  the 
electric  field  oriented  in  the  vertical  (y)  direction,  so  that  the  basic  excitation  is  TM. 
As  this  TM  wave  bounces  back  and  forth  inside  the  strip  region,  it  couples  some  TE 
energy  at  each  reflection  from  the  strip  sides.  If  no  TE  energy  were  coupled,  then  the 
inner  and  outer  TM  transmission  lines  of  Fig.  3 would  clearly  be  above  and  below  cut- 
off, respectively.  But,  we  may  observe  from  the  dispersion  curves  in  Fig.  4 that  the 
TE  wave  outside  (at  thickness  t^)  has  an  even  larger  value  of  n ^ than  does  the  TM 
wave  inside  (at  tj),  so  that  the  TE  wave  outside  is  seen  to  be  propagating  rather  than 
evanescent;  alternatively  phrased,  the  outer  TE  transmission  line  is  above  cutoff 
rather  than  below  cutoff.  Hence  the  condition  for  complete  guiding  is  not  satisfied,  and 
leakage  of  energy  transversely  away  from  the  waveguide  is  being  produced.  On  the 
other  hand,  if  the  initial  waveguide  excitation  were  of  the  TE  type,  a corresponding 
inquiry  would  indicate  that  the  resulting  guided  wave  would  be  completely  bound. 

The  leakage  produced  when  TM  excitation  is  incident  is  depicted  in  Figure  5.  A 


~1 ? 

Fig.  5.  Pictorial  representation  of  the  TE-TM 

coupling  at  the  strip  side,  which  produces 
leakage  when  the  TM  surface  wave  is 
incident  on  the  strip  side. 

TM  wave  incident  from  the  inside  excites  a reflected  propagating  TM  wave  inside,  an 
evanescent  TM  wave  outside,  and  propagating  TE  waves  both  outside  and  inside.  The 
propagating  TE  wave  outside  carries  away  a small  amount  of  energy  at  each  reflection, 
resulting  in  a leaky  mode  rather  than  a purely  bound  mode. 

Depending  on  how  close  the  dispersion  curves  of  Fig.  4 are  to  each  other,  the  TE 
wave  outside  may  or  may  not  be  evanescent,  and  the  resulting  guided  mode  may  or 
may  not  be  leaky.  Thus,  some  modes  may  be  leaky  or  not  depending  on  the  geometric 
parameters  of  the  waveguide.  We  have  here  indicated  qualitatively  how  leakage  can 
occur,  but  we  present  below  a criterion  for  leakage  which  permits  a quantitative  deter- 
mination of  when  leakage  is  present. 
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Fig.  6.  Dispersion  carves  for  the  first  class  of  rib  waveguides, 
for  which  value  ® lies  above  values  © and  CD,  so  that 
leakage  occurs  for  all  values  of  strip  width. 

unconditionally,  so  that  leakage  in  that  mode  will  be  present  for  all  values  of  strip 
width  W. 

Dispersion  curves  for  the  second  of  the  two  cases  are  given  in  Figure  7.  The 


Fig.  7.  Dispersion  curves  for  the  second  class  of  rib  waveguides, 
for  which  value  ® lies  between  values  © and  © . For 
this  case,  leakage  will  occur  only  for  sufficiently  narrow 
strips . 

designations  ®,  © and  © mean  the  same  thing  here  as  in  Fig.  6,  but  we  observe  here 
that  value  © lies  between  values  © and  © and  not  above  them  both,  as  in  the  first 
case.  Hence  we  conclude  that  leakage  will  no  longer  occur  for  all  strip  widths,  W.  In 
order  to  determine  for  what  values  of  W leakage  will  be  present,  we  must  solve  for 

("eff) guide  a®  a function  of  w- 

Thus,  inequality  ® is  not  helpful  as  it  stands  because  k (or  (n  „)  J ) is  not 

z eff  guide 

known  until  the  complete  problem  is  solved.  However,  we  may  employ  an  approxima~ 
tion  to  k^  to  provide  us  with  a useful  criterion.  The  so-called  equivalent  index  method/ 
which  is  a modification  of  the  Marcatili  approach/"*  or  the  equivalent  dielectric  constant 
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B.  A Simple  Quantitative  Criterion  for  Leakage 


With  reference  to  the  coordinate  system  indicated  on  Fig.  2,  the  explicit  condi- 
tion for  leakage  is 


(kTE)2  > 0 

' x 'out 


(1) 


T'r 

consistent  with  the  qualitative  discussion  above.  The  quantity  (k^  )out  is  the  trans- 


verse wavenumber  of  the  TE  wave  in  the  outside  region,  and  relation  (1)  states  that  the 
outer  TE  transmission  line  in  the  transverse  equivalent  network  in  Fig.  3 is  above  cut- 
off. 


npp 

The  actual  TE  wave  outside,  with  wavenumber  (k  ) „ where  s represents 

s out  r 

"surface  wave,  " will  be  propagating  away  from  the  strip  at  some  angle,  and  it  will 


TE 

possess  components  k^  along  the  waveguide  direction  and  (k^  ) perpendicular  to  it 


in  the  plane  of  the  thin  dielectric  film.  Quantity  k needs  no  other  qualifying  indices 


because  all  constituents  of  the  net  guided  mode  along  the  waveguide  must  possess  the 


same  k value.  We  may  thus  write 


TE.  2 


(k“J|1'  = k2  + (kTE)2 

s 'out  Z ' X 'out 


(2) 


so  that 


(kTE)  > k 
s 'out  z 


(3) 


or,  dividing  by  k. 


^neff  ^out  > ^"eff^guide 


(4) 


in  the  light  of  relation  (1). 


The  range  of  possible  rib  waveguide  geometries  falls  into  two  cases.  The  dis- 


persion curves  for  the  first  of  these  cases  appear  in  Figure  6.  The  value  of  n ^ indi- 


cated  as  (J)  corresponds  to  (neff  )out*  Furthermore,  ( ne£f )gU^cje  wiU  vary  between  the 
values  denoted  by  (2)  and  ©,  although  it  may  not  follow  the  TM  curve  itself.  The 
validity  df  that  statement  is  easily  verified.  When  the  strip  width  W becomes  extremely 
large,  the  inside  region  of  thickness  t}  dominates  the  geometry,  and  the  value  of 
(neff )gUide  must  approach  value  ©.  (Recall  that  we  are  discussing  the  case  of  TM  ex- 
citation.) When  W becomes  zero,  the  outside  region  of  thickness  t^  is  present  every- 
where, and  the  value  of  (neff)gUjcje  must  reduce  to  (3).  For  this  case,  therefore,  we 
observe  that  value  ® is  higher  than  values  (D  and  Q) , i.e.,  condition  @ is  satisfied 
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method/^’  which  is  the  first  step  of  a transverse  resonance  procedure,  can  supply 
us  with  an  approximation  which  is  reasonably  accurate  and  yet  simple.  In  both  of  these 
approximate  procedures,  the  TE-TM  coupling  is  neglected  entirely;  as  a result,  neither 
method  can  predict  any  leakage,  but  the  k^  values  obtained  are  reasonably  reliable 
under  most  conditions. 

As  an  approximation,  therefore,  we  take 


k = (k  ) 
z z approx. 


(5) 


where  the  k^  found  will  be  purely  real  and  corresponds  to  the  case  of  TM  waves  alone. 
Then,  using  Eqs.  (3)  or  (4)  we  write 


(kTE)  . > (k  )™ 
s 'out  z approx. 


, TE.  . . TM. 
eff  'out  eff  guide 

approx. 

Now,  a plot  of  (n^1) 

r 'eff  'guide 


(6) 


(7) 


as  a function  of  strip  width  W can  be  obtained  rela- 


approx. 

tively  easily,  avoiding  the  complications  introduced  by  TE-TM  coupling  and  the  continu- 
ous spectrum  contributions.  A typical  such  plot  for  the  dispersion  curve  features  ex- 
hibited in  Fig.  7 is  shown  in  Figure  8.  As  seen,  consistent  with  condition  (7),  the  rib 


Fig.  8.  Plot,  as  a function  of  strip  width,  of  the  effective 
refractive  index  for  the  mode  guided  along  the  rib 
waveguide  for  TM  excitation,  for  the  case  consid- 
ered in  Fig.  7,  where  the  effective  index  values 
are  calculated  approximately.  Leakage  is  seen  to 
occur  only  for  the  narrower  strip  widths. 


waveguide  will  leak  only  for  W<  WQ,  where  Wq  is  the  crossing  point  indicated  in  Figure 
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8 Leakage  for  TM  excitation  is  thus  seen  to  occur  for  this  case  only  for  small  strip 

widths.  It  is  therefore  possible  to  design  the  waveguide  geometry  to  avoid  leakage  al- 
together  if  one  so  wishes. 

If  TE  excitation,  rather  than  TM  excitation,  i.  incident  on  th.  rib  waveguide,  then 
the  condition  corre.ponding  to  relation  (7,  for  TM  excitation  would  become 


. TM. 
neff  out 


„ t TE. 

' eff  'guide 


(8) 


« tht.  inequality  w.re  ..ti.fi.d,  then  the  TE  guided  mode  would  leak  via  ,h.  energy 
coupled  a,  the  atrip  aide,  to  the  TM  wave,  te  term,  of  the  di.per.ion  plot  of  the  type 

TX‘"  **'  F‘8'  V“,'Je  ® WOU“  now  'orrespond  to  <n™|  ,„d  values  © 

and  © would  be  tho.e  on  the  TE  curve  a.  th.ckn,....  , and  , restively.  Und.r 

tto.e  condition.  w.  observe  that  ® would  always  be  lower  than  ®,  ,0  that  condition 

(8)  would  never  be  satisfied.  Thu.,  since  the  TE  .urface  wav.  i.  ,h.  dominant  mode 

m dies,  di.per.ion  plots,  th.  TE  mod.  on  the  rib  waveguide  would  nevlT^T no  matte, 
what  the  value  of  strip  width. 

— Application  to  Rib  Waveguide  Devices 

Th.  rib  waveguide  .tructure  was  devised  by  the  Bell  Laboratory,  and  then  em- 
ployed by  them  ...eral  device.,  .uch  a.  modulator,  and  .witches,  I„  mea.ure- 

men„.  they  did  not  detect  th,  presence  of  leakage  from  th,  waveguide,  they  employed 
n an  attempt  to  v.r.fy  th.i,  results,  w.  applied  ,h.  criteria  for  leakage  pre.en.ed  above 
to  several  rib  waveguides  which  they  used  in  their  devices.18 

All  of  their  measurements  were  taken  at  the  wavelength  X = 0.8966p.  Six  different 
waveguides  were  measured,  but  the  refractive  indices  for  all  were  the  same: 


nf 

n 

s 

n 


3.  438 1 , film 

3.  3842,  substrate 

1.720  , superstrata  (not  air). 


diffe  ed  r~  WaVeeUld'!*  had  th'  ,U“  thicknesses  (or  heigh..,,  ,„d 

differed  only  strip  width,  the  .ec„„d  set  of  three  waveguide,  had  th.  .am.  strip 

21  I " I b”‘  ,Um  *"<'  ““<■  thicknesses.  Thus,  using  noU. 

ti°n  ln  Ei«-  2,  we  have  for  the  first  set 

tj  = 0.800p  , t2  = 0.  730 p 

and  for  the  second  set 


tj  = 0. 700p  , t2  = 0. 630p. 

For  each  set,  the  three  strip  widths  were 

W1  = 3.50p  , = 3 . OOp  , W3  = 2.50ji. 
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Before  we  can  apply  the  criteria  for  leakage  developed  here,  we  must  have  avail- 
able the  dispersion  curves  for  the  TE  and  TM  modes  in  the  strip  and  film  regions  sep- 
arately, as  if  each  region  were  infinitely  wide.  We  thus  would  have  one  pair  of  curves 
for  the  first  set  of  strip  and  film  thicknesses,  and  another  pair  for  the  second  set,  as 
shown  respectively  in  Figures  9 and  10.  On  these  curves  we  have  indicated  the  values 
0,  © and  (D,  corresponding  to  those  in  Figures  6 and  7.  We  observe  immediately 
that  in  both  Figs.  9 and  10  value  0 lies  between  values  © and  0,  as  in  Fig.  7,  indi- 
cating that  the  TM  mode  on  these  rib  waveguides  will  leak  only  for  sufficiently  narrow 
strip  widths,  but  not  for  all  strip  widths. 

The  next  step  is  to  determine  the  range  of  strip  widths  for  which  leakage  will  oc- 

TM 

cur.  Toward  this  end,  so  that  criterion  (7)  can  be  used,  we  have  calculated  (n^  )gUicje 
as  a function  of  strip  width  W for  each  set  of  thicknesses  (corresponding  to  Figs.  9 and 
10)  using  the  approximate  equivalent  dielectric  constant  method,  which  is  a simple 
procedure  corresponding  to  the  first  step  in  a transverse  resonance  approach  which 
neglects  the  TE-TM  coupling  and  the  continuous  spectrum  contributions.  These  ap- 
proximate calculations  are  presented  in  Figs.  11  and  12,  where  the  curves  shown  rep- 
resent the  two  lowest  modes,  due  to  TM  excitation,  guided  by  the  rib  waveguide. 

The  range  of  W over  which  leakage  occurs  is  indicated  in  Figs.  11  and  12  by  the 
dashed  lines,  and  W^  is  the  largest  width  for  which  leakage  can  be  present.  It  is  seen 
that  for  five  of  the  six  waveguides  the  strip  width  is  larger  than  W^ , so  that  leakage 
would  not  be  expected.  For  the  sixth  waveguide,  the  value  of  occurs  very  close  to 
Wq,  right  at  the  edge  of  the  leakage  range.  Since  the  rate  of  leakage  must  come  to 
zero  at  the  edge  of  the  range,  the  leakage  rate  for  that  waveguide  must  necessarily  be 
quite  small  --  difficult  to  detect  experimentally  and  probably  negligible  with  respect  to 
any  practical  consequence. 

If  the  strip  width  were  made  narrower,  leakage  would  then  be  possible.  However, 
this  criterion  indicates  only  whether  or  not  leakage  will  occur  in  principle;  it  says 
nothing  about  the  amount  of  leakage.  Depending  on  the  geometrical  parameters,  the 
amount  of  leakage  may  be  significant,  or  so  small  as  to  be  of  no  practical  import.  The 
effective  discontinuity  provided  by  the  sides  of  the  strip  is  the  key  feature,  because  that 
is  where  the  coupling  occurs  between  the  TE  and  TM  modes.  If  the  effective  coupling 
is  too  small,  the  amount  of  leakage  will  be  miniscule  and  the  effect  unimportant.  How- 
ever, it  is  necessary  to  solve  the  complete  boundary  value  problem,  including  the  TE- 
TM  coupling,  before  one  can  determine  the  leakage  rate  quantitatively  --a  short  cut  of 
the  type  presented  here  cannot  be  developed. 

Joint  Services  Technical  Advisory  Committee 

F 446 20-  74- C- 0056  A.  A.  Oliner 


OPTICS 


REFERENCES 

1.  H.  Furuta,  H.  Noda  and  A.  Ihaya,  "Novel  Optical  Waveguide  for  Integrated  Optics,  " 
Appl.  Opt.,  13,  pp.  322-326  (February  1974). 

2.  N.  Uchida,  "Optical  Waveguide  Loaded  with  High  Ref ractive- Index  Strip  Film,  " 
Appl.  Opt.,  15.  pp.  179-182  (January  1976). 

3.  J.E.  Goell,  "Rib  Waveguides  for  Integrated  Optical  Circuits,"  Appl.  Opt.,  12, 
pp.  2797-2798  (December  1973). 

4.  F.K.  Reinhart,  R.A.  Logan  and  T.  P.  Lee,  "Transmission  Properties  of  Rib 
Waveguides  Formed  by  Anodization  of  Epitaxial  GaAs  on  AlxGai-xAs  Layers,  " 

Appl.  Phys.  Lett.,  2_4,  pp.  270-  272  ( 1 5 March  1974). 

5.  S.T.  Peng  and  A.  A.  Oliner,  "Leakage  and  Resonance  Effects  on  Strip  Waveguides 
for  Integrated  Cptics,  " Trans.  Inst,  of  Electronics  and  Communication  Engineers 
of  Japan,  Special  Issue  on  Integrated  Optics  and  Optical  Fiber  Communications, 

E6 1 , pp.  151-154  (March  1978). 

6.  R.  Th.  Kersten  and  H.  Boroffka,  "Ion  Implantation  Into  Fused  Quartz  for  Integrated 
Optical  Circuits,"  Optics  Comm.,  _L7,  pp.  119-123  (April  >976). 


7.  R.  Th.  Kersten,  "A  Directional  Coupler  with  Large  Waveguide  Separation,  " Optics 
Comm.,  _1_7,  pp.  124-  128  (April  1976). 

8.  A.  A.  Oliner,  "Optical  and  Acoustical  Microelectronics:  Similarities  and  Differ- 
ences, " Proc.  Symp.  Optical  and  Acoustical  Micro- Electronics,  Polytechnic  Inst, 
of  N.  Y.  , pp.  1-17  (especially  12,  13)  (April  1974). 

9.  J.  Hamasaki  and  K.  Nosu,  "A  Partially  Metal- Clad-Dielectric-Slab  Waveguide  for 
Integrated  Optics,"  IEEE  J.  Quantum  Electron.,  _1_0,  pp.  822-825  (October  1974). 

10.  Y.  Yamamoto,  T.  Kamiya  and  H.  Yanai,  "Propagation  Characteristics  of  a Partial- 
ly Metal- Clad  Optical  Guide:  Metal-Clad  Optical  Strip  Line,"  Appl.  Opt.,  _L4,  pp. 
322-326  (February  1975). 

11.  A.  A.  Oliner  and  S.T.  Peng,  "Waveguides  for  Integrated  Optics  Formed  by  Metal 
Platings,"  Proc.  Sixth  European  Microwave  Conf.  , pp.  499-503,  Rome,  Italy 
(September  1976). 

12.  E.A.J.  Marcatili,  "Slab- Coupled  Waveguides,"  Bell  Syst.  Tech.  J.  , 53,  pp.  645- 
674  (April  1974). 

13.  E.W.  Hu,  S.T.  Peng  and  A.  A.  Oliner,  "A  Novel  Leaky- Wave  Strip  Waveguide 
Directional  Coupler,  " Topical  Meeting  on  Integrated  and  Guided  Wave  Optics,  Paper 
No.  WD2,  Salt  Lake  City,  Utah  (January  1978). 

14.  V.  Ramaswamy,  "Strip- loaded  Film  Waveguides , " Bell  Syst.  Tech.  J.,  53,  p.  697 
(April  1974). 

15.  E.A.J.  Marcatili,  "Dielectric  Rectangular  Waveguide  and  Directional  Coupler  for 
Integrated  Optics,"  Bell  Syst.  Tech.  J.,  48,  pp.  2071-2102  (September  1969). 

16.  A.  A.  Oliner  and  S.T.  Peng,  "Optical  Slot  and  Strip  Waveguides  Formed  by  Metal 
Plalings,  " Digest  of  Topical  Meeting  on  Integrated  Optics,  Paper  MC5,  Salt  Lake 
City,  Utah  (January  1976). 

17.  W.V.  McLevige,  T.  Itoh  and  R.  Mittra,  "New  Waveguide  Structures  for  Millimeter 
Wave  and  Optical  Integrated  Circuits,"  IEEE  Trans.  Microwave  Theory  Tech., 
MTT-23.  pp.  788-794  (October  1975). 

18.  The  rib  waveguide  parameter  values  were  kindly  furnished  to  us  by  F.K.  Reinhart 
and  J.C.  Shelton,  Bell  Laboratories,  Murray  Hill,  N.J. 


OPTICS 


113 


MODULATION  SENSITIVITY  OF  ELECTRO-OPTIC  SLAB  WAVEGUIDES 
S.  T.  Peng 

The  advantage  of  economy  in  power  and  size  afforded  by  thin-film  waveguiding 

structures  for  electro-optic  and  acousto-optic  modulation  and  beam  deflection  has  been 
12  3 

well  recognized.  ’ ’ The  central  question  concerning  such  devices  is:  what  is  the 
incremental  phase  change  of  a confined  mode  under  the  action  of  an  applied  electric  or 

acoustic  field?  Such  a question  was  first  explored  by  Lotspeich  for  a single  layer 

1 2 3 

structure  and  the  results  were  extended  by  Buckman  for  multilayer  structures.  ’ 

These  previous  analyses  were  primarily  based  on  a numerical  analysis  which  is  time 
consuming  and,  more  importantly,  lacks  physical  insight  into  the  wave  phenomenon 
associated  with  the  devices.  So  far,  no  systematic  design  criterion  has  been  put  forth 
in  the  literature.  In  this  work,  we  have  adopted  a new  approach  and  successfully  de- 
rived simple  analytic  formulas  for  the  modulation  sensitivity  of  a structure  with  any 
number  of  layers.  As  an  illustration,  the  results  for  a TE  mode  in  a single  layer  struc- 
ture are  given  in  what  follows. 

A thin  film  electro-optic  modulator  is  shown  schematically  in  Figure  1.  The 
electro-optic  material  of  refractive  index  n^  and  thickness  t is  sandwiched  between 


Fig.  1.  Thin  film  electro-optic  modulator. 

two  uniform  media  of  refractive  indices  and  n^,  respectively.  Without  loss  of 
generality,  the  refractive  indices  may  be  normalized  such  that  n =1,  and  the  space 
above  the  film  may  then  be  referred  to  as  the  air  region  and  the  space  below  as  the 
substrate  region.  Therefore,  the  subscripts  a,  f,  and  s designate  the  air,  film  and 
substrate  regions,  respectively.  For  the  confinement  of  optical  energy  within  the  thin 
film,  the  refractive  indices  must  satisfy  n^>ng>na. 

It  is  well  known  that  the  propagation  characteristic  of  surface  waves  along  the 
thin  film  are  completely  determined  by  a dispersion  relation  (or  eigen  equation)  of 
the  electromagnetic  boundary  value  problem.  It  can  be  shown  that  the  dispersion 
relation  may  be  cast  into  many  different  mathematical  forms;  with  physical  insight 
into  the  basic  surface-wave  phenomenon  we  have  observed  that  one  of  them  is  parti- 
cularly useful  for  the  study  of  electro-optic  modulators.  Such  an  observation  leads 
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to  the  derviation  of  simple  analytic  formulas  by  which  the  analysis  and  design  of  electro- 
optic modulators  can  be  easily  performed. 

The  variable  of  interest  here  is  the  modulation  sensitivity  of  the  active  thin  film 
waveguide  structure:  ’ S = dg^/dkfl  where  g^  is  the  propagation  constant  of  a guided 
surface -wave  mode  and  kf  is  the  plane  wave  propagation  constant  of  the  electro-optic 
medium.  If  we  define  the  effective  index  of  refraction: 


n = g /k 
g g'  o 


(1) 


where  kQ  is  the  plane  wave  propagation  constant  in  free  space,  we  then  have  the  modu- 
lation sensitivity: 


dg  dn 

s = — s.  = — s. 

dkj.  dn^. 

for  which  we  have  made  use  of  the  relations; 
kr  = kQ  nr  . for  r = a,  f,  and  s . 


(2) 


(3) 


We  observe  that  instead  of  the  indices  of  refraction,  it  will  be  more  convenient  to  ana- 
lyze the  electro-optic  modulators  in  terms  of  the  dielectric  constants: 

2 


e = n 
r r 


(4) 


where  r may  stand  for  not  only  a,  f,  and  s,  but  also  g (for  guided  modes).  From  Eqs. 
(2)  and  (4),  the  modulation  sensitivity  may  be  written  alternatively  in  the  form: 


d n n,  de 

s = -_S  = _£  —S. 

dn,  n de, 

f g f 


(5) 


For  the  structure  shown  in  Fig.  1,  it  is  well  known  that  ng  < n < nf.  Consequently, 
we  have : 


nf  nf 

1 < _L  < _£ 

n — n 
E s 


(6) 


Therefore,  the  first  factor  of  the  modulation  sensitivity  given  by  Eq.  (5)  is  always  greater 
than  unity  and  is  bounded  above  by  the  ratio  n^/n^.  We  have  derived  simple  analytic 
expressions  for  the  other  factor,  de  /de  , for  both  TE  and  TM  polarizations.  For 
example,  in  the  case  of  an  active  layer  in  free  space  ( ng  = na)>  we  have,  for  the  TE 
polarization: 


de 

jn*  = 1 -f(c  ) 
def  g' 


(7) 


T 


with 


cos2[k0tVcf-cg] 

1 + k t It  - e 
o V g a 
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(8) 


where  f as  a function  of  is  obviously  bounded  below  and  above  by: 

0 < fUg)  1 1 ' 

for  any  guided  TE  mode  of  the  structure  under  consideration.  Evidently,  it  then  follows 
from  Eqs.  (7)  and  (9)  that 


de 

0 < -5-S-  < 1 . (10) 

- d£f 


It  is  noted  that  the  equality  sign  in  Eqs.  (5),  (8),  and  (9),  holds  if  and  only  if  a mode  is 
right  at  the  cutoff  condition  ( n^  = na)'  Although  analytic  expressions  for  the  factor 
dc  /dej  become  more  complicated  if  the  substrate  is  present,  the  inequalities  in  Eq. 

(10)  hold  in  any  case.  In  view  of  Eqs.  (6)  and  (10),  it  is  evident  from  Eq.  (5)  that  we 
have: 

0 < S < n / n (11) 

— is 

Again,  the  equality  sign  holds  only  at  the  cutoff  condition.  Therefore,  we  can  now  draw 
the  conclusion:  for  a single  layer  of  electro-optic  material  on  a substrate,  the  modu- 
lation sensitivity  can  never  be  equal  to  or  exceed  the  ration  n ng. 

1 -3 

It  is  noted  that,  in  contrast  to  the  published  approached  based  on  a dispersion 
relation  in  terms  of  phase  functions,  our  analysis  is  based  on  one  in  terms  of  impedance 
function.  The  advantage  of  this  different  approach  is  that  it  yields  simple  analytic  ex- 
pressions for  the  modulation  sensitivity  for  multilayer  structure;  this  opens  the  way 
for  an  optimized  synthesis  procedure  for  thin-film  modulators. 
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RAY  ANALYSIS  OF  UNSTABLE  RESONATORS 
S.  H.  Cho  and  L.  B.  Felsen 

The  ray  analysis  of  high-frequency  propagation  and  scattering  problems  is  suc- 
cesful  especially  when  the  required  number  of  constituent  ray  fields  is  not  very  large. 
This  is  often  the  case  when  an  impenetrable  scatterer  is  in  a free  space  environment 
since  multiple  diffraction  phenomena  between  scattering  centers,  if  these  exist,  de- 
crease rapidly  in  intensity.  However,  when  the  propagation  environment  includes 
multiple  boundaries,  between  which  fields  can  be  reflected  an  arbitrarily  large  number 
of  times  without  substantial  loss,  the  resulting  "ray  series"  becomes  cumbersome  and 
inaccurate.  Therefore,  the  success  of  the  ray  method  in  the  presence  of  multiple  re- 
flectors depends  on  one's  ability  to  treat  all  higher  order  reflected  rays  collectively 
so  that  the  ray  series  can  be  truncated  at  a tractable  limit.  If  the  reflector  surfaces 
are  infinite  in  extent,  thereby  forming  a waveguide  or  duct,  the  propagating  fields  can 
alternatively  be  expressed  in  terms  of  modes  guided,  loosely  speaking,  ir»  a direction 
parallel  to  the  boundaries.  Guided  modes  can  be  represented  in  terms  of  modal  ray 
congruences  (families  of  rays)  and  the  associated  wavefronts  (equiphase  surfaces), 
one  upgoing  and  one  downgoing,  such  that  the  boundary  conditions  on  the  reflector 
surfaces  are  satisfied  (Fig.  L).  To  represent  a modal  field,  these  ray  congruences 


CENTER  OF  CURVATURE 


Fig.  1.  Ray  congruences  for  self-consistent  fields 
between  infinite  hyperbolic  mirrors. 
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must  be  self-consistent  in  the  sense  that  the  upgoing  congruence,  on  reflection  at  the 
upper  boundary,  is  converted  into  the  downgoing  congruence  which,  after  reflection  at 
the  lower  boundary,  again  becomes  the  original  upgoing  congruence.  Between  curved 
boundaries,  these  congruences  are  generated  by  "modal  caustics"  whose  location  is 
such  as  to  meet  the  self-consistency  constraint.  The  self-consistent  caustics  furnish, 
equivalently,  the  eigenvalues  for  the  guided  modes.  A special  example  of  these  modal 
congruences  and  wavefronts  in  the  unstable  resonator  environment  is  furnished  by 
Siegman's2  two  spherical  waves  emanating  from  the  foci  of  an  elliptic  coordinate  system 
when  the  boundaries  are  infinite  hyperboloidal  mirrors;  since  the  mirrors  are  actually 
truncated  near  the  apex,  they  can  be  approximated  there  by  spherical  shapes. 

The  ability  to  represent  fields  between  infinite  mirrors  by  two  modal  ray  con- 
gruences suggests  that  source  or  diffraction  excited  ray  fields  with  many  reflections 
may  be  combined  collectively  into  modal  congruence  form.  Our  confirmation  of  this 
conjecture  has  rendered  the  ray-optical  treatment  of  the  unstable  resonator  feasible. 

The  analysis  proceeds  by  assuming  an  incident  ray  field  (modal  congruence)  appropriate 
to  the  infinite  mirror  structure.  When  the  mirrors  are  truncated,  this  incident  field 
excites  the  mirror  edges  which  thereby  become  scattering  centers  for  GTD  (geometrical 
theory  of  diffraction)  edge  diffracted  rays.  These  rays  propagate  back  into  the  resonator 
via  direct  (Fig.  2a)  and  multiply  reflected  paths  (Fig.  2b),  with  amplitudes  and  phases 
determined  according  to  the  rules  of  GTD;  they  modify  the  incident  field  on  the  mirrors. 


diffracted  rays 


Fig.  2a.  Diffraction  of  incident  modal  field. 
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Fig.  2b.  Multiple  reflection  of  primary  diffracted  field 
Although  a ray  can  undergo  an  arbitrarily  large  number  of  reflections  between  the  mir- 
rors before  reaching  an  observation  point,  the  collective  behavior  of  all  high-order 
reflected  rays  with  more  than  N reflections  reduces  to  (2N+1)  the  number  of  rays 
reaching  an  observation  point.  The  choice  of  N is  not  critical  but  must  be  large  enough 
to  meet  the  "collective  (i.e.,  modal)  ray"  criterion. 

When,  after  traversing  the  resonator,  each  of  the  multiply  reflected  edge  dif- 
fracted rays  impinges  on  the  edges  along  its  geometrically  determined  incidence  angle 
(Fig.  2c),  each  ray  excites  (2N+1)  secondary  edge-diffracted  rays.  These  rays  propa- 
gate through  the  resonator  like  the  primary  diffracted  rays  and  provide  secondary 

^FOCUS 

\ 

\ 

\/\\  If* 

/ \ \\  I 


- 2N  + I RAYS 
Gi*  COLLECTIVE  RAYS 


X»l  OBSERVATION  POINT 

Fig.  2c.  Total  field  at  edge  after  primary  diffraction, 
incident  fields  at  the  edges,  along  (2N+1)  trajectories  whose  directions  coincide  with 
those  of  the  previous  diffraction  order.  The  same  comment  applies  to  any  higher  order 
order  of  diffraction.  At  this  stage  of  the  analysis,  one  may  seek  resonant  field  solu- 
tions by  demanding  that  the  edge  excitation  for  two  successive  orders  of  diffraction 
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are  the  same  (Fig.  3).  To  satisfy  this  self-consistency  requirement,  which  leads  to 
a resonance  equation,  it  is  necessary  to  adjust  the  incidence  angles  of  the  (2N+1)  rays 
at  the  edges,  including  that  of  the  collective  modal  ray.  This,  in  turn,  implies  the 
proper  selection  of  the  modal  caustic  parameter,  which,  in  fact,  becomes  the  eigen- 
value of  the  resonant  mode.  After  the  eigenvalues  have  been  found  from  solution  of 
the  resonance  equation,  the  corresponding  resonant  modal  field  is  obtained  as  the 
sum  of  the  (2N+1)  multiply  reflected  ray  fields. 
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Fig.  3.  Resonance  condition. 

We  have  applied  the  above  procedure  to  symmetrical  strip  and  circular  mirror 
resonators.  After  making  an  approximation,  our  ray-optically  derived  resonance 
condition  can  be  simplified  to  agree  completely  with  that  derived  by  Horwitz3  from  a 
postulative  approach  based  on  an  asymptotic  treatment  of  the  resonator  integral  equa- 
tion. Since  the  approximations  made  at  various  stages  of  the  ray-optical  analysis  have 
been  stated  carefully,  it  is  possible  to  identify  what  must  be  done  if  the  present  results 
are  to  be  improved.  Moreover,  the  direct  comparison  of  quantities  and  mathematical 
relations  appearing  in  our  study  with  those  of  Horwitz  permits  the  deductive  under- 
standing of  Horwitz's  postulative  approach  and  provides  a physical  basis  for  his 
mathematically  convenient  assumptions  at  various  stages.  A comparison  has  also 
been  made  with  the  resonance  equation  obtained  by  the  waveguide  technique  of  Santana 
and  Felsen,  and  it  has  been  shown  that  approximate  analytical  solutions  obtained  by 
these  two  different  procedures  for  the  lowest  mode  eigenvalues  at  adequately  large 
Fresnel  numbers  are  in  agreement.  By  combining  a collective  with  an  individual  treat- 
ment of  ray  contributions,  the  present  method  thus  furnishes  a link  between  the  ray- 
optical  and  waveguide  schemes. 
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The  complex  eigenvalues  and  the  corresponding  eigenmode  fields  calculated  from 
equations  derived  by  the  Horwitz  method  have  been  found  to  agree  well  with  those  gen- 
erated by  direct  numerical  solution  of  the  resonator  integral  equation.  The  importance 
of  the  present  study  is  not  limited  to  providing  a deductive  derivation  of  these  earlier 
results  for  idealized  two-dimensional  and  three-dimensional  shapes.  Because  of  the 
generality  of  the  ray  method,  which  can  account  for  imperfect  mirror  geometries  and 
for  propagation  phenomena  ascribed  to  local  medium  inhomogeneities,  it  is  suggestive 
that  our  treatment  may  be  extended  to  more  complicated  resonator  configurations,  and 
also  to  resonators  loaded  with  active  media  that  may  be  locally  non-uniform.  These 
aspects  are  presently  under  consideration.  The  route  to  such  extensions  is  not  readily 
perceived  by  Horwitz's  postulated  functional  expansion  approach. 
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SMALL  MISALIGNMENT  EFFECTS  IN  UNSTABLE  RESONATORS 
C.  Santana  and  L.B.  Felsen 

A recently  published  communication*  contains  a numerical  study  of  the  integral 
equation  for  unstable  strip  resonators  with  misaligned  sharp- edged  circular  mirrors. 
The  results,  which  cover  only  a limited  range  of  Fresnel  numbers,  indicate  that  even 
very  small  misalignments  can  have  a marked  effect  on  the  resonant  mode  loss  behavior. 
In  particular,  the  misalignment  leads  to  mode  detachment  av  low  Fresnel  numbers  and 
produces  additional  oscillations  in  the  eigenvalue  curve.  The  regularity  of  these  oscil- 
lations led  the  authors*  to  suggest  that  this  behavior  would  be  valid  also  for  other  para- 
meter values,  and  to  indicate  a criterion  for  choosing  Fresnel  numbers  that  minimize 
the  misalignment  effects.  We  show  here  that  when  the  eigenvalues  are  examined  over 
a broader  range  of  Fresnel  numbers,  the  conclusions  based  on  the  data  in  Ref.  1 are 
misleading.  As  seen  from  Fig.  1,  which  contains  our  data  (solid  curve)  and  those  from 
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EQUIVALENT  FRESNEL  NUMBER 

Fig.  1.  Eigenvalue  behavior  for  lowest  order  detached  mode.  Misalignment 
parameter  t =0.01262  (see  Ref.  1).  The  eigenvalues  are  normalized 
to  the  geometric  optical  value  for  infinite  mirrors. 
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Ref.  1 (shown  as  crosses),  the  eigenvalue  plot  loses  its  regularity  and  exhibits  changed 
periodicities.  These  features  can  be  explained  physically  by  the  alternative  method, 
summarized  below,  that  was  used  in  our  analysis.  We  have  also  calculated,  and  shall 
interpret,  data  for  larger  misalignments  than  those  given  in  Reference  1. 

The  solid  curve  in  Fig.  1 was  produced  by  solution  of  the  resonance  equation 

‘1  I >1  * t R^0,rnn-  i]  [ ^e|F„-  i]  = o HI 

which  is  obtained  when  the  resonator  is  viewed  as  a waveguide  in  the  direction  trans- 

2 

verse  to  its  axis.  In  deriving  Eq.  (1),  the  tilted  mirror  geometry  is  first  mapped  into 

a configuration  with  non-tilted  mirrors,  which  are  asymmetrical  with  respect  to  the 

resonator  axis.  In  this  configuration,  R and  T identify  the  modal  reflection  coef- 
2 n nn 

ficients  due  to  the  modal  caustic  at  the  center  of  the  resonator  and  the  mirror  edges, 
respectively,  with  the  arrows  indicating  the  two  possible  transverse  directions  seen 
from  a preselected  reference  point.  The  (e)  and  (o)  superscripts  refer  to  modal  fields 
with  even  and  odd  symmetry  about  the  resonator  axis;  these  fields  are  coupled  in  the 
asymmetrical  structure.  Analytical  expressions  for  the  above  quantities  are  derived 
by  generalization  of  results  presented  previously  for  the  perfectly  aligned  symmetrical 
configuration,  where  the  even  and  odd  modes  exist  independently.  A simplified  version 
of  the  full  resonance  equation  in  (1)  derived  by  us  provides  basic  insights  into  the  phys- 
ical mechanisms  responsible  for  the  eigenvalue  behavior. 

The  resonance  equation  in  ( 1 ) can  actually  accommodate  misaligned  mirrors  of 
unequal  size  and  radius,  and  with  sharp  or  softened  edges.  The  present  example 
constitutes  a special  case  of  this  general  theory,  and  its  agreement  with  the  independent- 
ly obtained  results  in  Ref.  1,  over  the  limited  range  available,  serves  as  further  con- 
firmation of  its  validity. 
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PROPAGATION  IN  INHOMOGENEOUS  SLAB  WAVEGUIDES  --  EXACT  SOLUTIONS 
E.  Navon  and  L.  B.  Felsen 

A.  Introduction 

Slab  waveguides  with  an  inhomogeneous  dielectric  profile  embedded  in  a homoge- 
neous environment  find  application  in  integrated  optics,  and  they  also  serve  as  proto- 
type models  for  graded  index  optical  fibers.  Solutions  for  the  modal  fields  and  modal 
propagation  coefficients  in  these  configurations  must  generally  be  determined  by  ap- 
proximate methods  suitable  for  the  high-frequency  range.  In  order  to  check  the  validity 
of  these  methods  and  the  accuracy  of  the  results,  it  is  important  to  have  exact  solutions 
for  special  profiles,  available  for  comparison.  Here,  we  examine  two  such  special 
profiles,  the  parabolic  and  hyperbolic  tangent.  The  formulation  and  analysis  is  per- 
formed in  such  a manner  that  the  procedure  becomes  applicable  also  to  more  general 
profiles,  which  are  to  be  treated  by  the  recently  developed  evanescent  wave  tracking 
method.  This  method  has  been  shown  to  yield  high  frequency  asymptotic  modal  solutions 
when  the  inhomogeneous  refractive  index  profile  extends  to  infinity.1  By  comparison 
with  the  exact  solutions  developed  below,  the  evanescent  wave  tracking  method  is  being 
extended  to  account  for  truncated  profiles  within  a homogeneous  surround. 

The  truncated  parabolic  profile  has  been  treated  previously,  but  by  a procedure 
less  suited  to  our  purpose  than  the  one  presented  here.*1  Our  solutions  for  the  truncated 
hyperbolic  tangent  profile  are  new. 

B.  General  Formulation 

We  seek  solutions  u(x,  z)  of  the  scalar  wave  equation 

[V2  + k2n2(x)]  u(x,  z)  = 0,  3 / 9 y E 0 (1) 

where  k is  the  free  space  wavenumber  and  n(x)  is  a real  refractive  index.  An  exp(-iojt) 
time  dependence  is  suppressed.  These  source-free  (modal)  fields  should  be  bounded  in 
the  entire  interval  -»  < x < »,  and  therefore  decay  for  large  x,  and  they  propagate 
along  the  z-direction.  The  guiding  medium  consists  of  an  inhomogeneous  but  symmetric 
inner  (core)  region  wherein  n(x)  behaves  parabolically  near  the  x = 0 plane;  the  core  is 
surrounded  by  an  infinite  homogeneous  exterior  medium  (see  Figure  1).  At  the  bound- 
aries | x | = x^  between  these  regions,  the  refractive  index  may  have  a finite  discontinuity. 
Thus  n2(x)  is  given  by 


n^(x)  = 


(2) 


Fig.  1.  Various  profile  shapes,  (a)  General  symmetrical  profile; 

(b)  parabolic  profile.  unbounded  profile,  bounded 

profile  with  (i)  continuous  n£  = n(xc);  (ii)  n£  > n(xc);  (iii) 

n-  < n(x  ). 

2 ' c' 

where  E n(0),  and  n(x)  is  a function  description  of  the  profile.  For  the  special  case 
of  a continuous  profile,  one  has  n^  = n(xc). 

We  shall  consider  first  the  solution  for  an  infinite  core  wherein  x — • °°.  We  seek 

c 

to  determine  the  discrete  set  of  modal  propagation  coefficients  (3^  , which  will  be  found 
to  depend  on  the  integer  I that  also  serves  to  distinguish  various  modes.  When  x^  is 
finite,  the  presence  of  the  exterior  region  modifies  the  propagation  coefficients  in  that 
the  integer  t for  the  unbounded  core  is  replaced  by  a non-integral  index  v: 

v = I + (3) 

Accordingly,  the  propagation  coefficients  p^  undergo  the  transformation  — The 
dispersion  equation  for  Av^  is  derived  by  imposing  the  boundary  conditions  at  |x|  =*c. 
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We  shall  also  require  a consistent  procedure  for  uniquely  normalizing  the  eigen- 
functions.  The  construction  of  even  and  odd  solutions,  as  employed  by  Hashimoto,  is 
found  to  be  helpful.  However,  we  define  these  solutions  in  terms  of  known  functions, 
which  have  series  expansions  valid  in  all  regions  of  x,  rather  than  in  terms  of  the 
integral  representations  used  previously.^  The  series  expansion  forms  are  found  to  have 
advantages  of  analytic  simplicity  and  computational  convenience. 

In  accord  with  the  preceding  remarks,  the  wave  function  u(x,  z),  to  be  identified 
with  the  transverse  electric  field  E^,  is  expressed  as 

uv(x,z)  = fv(x)  exp(i|3^z)  (4) 


where  p^  is  the  propagation  coefficient  in  the  z direction.  Substitution  into  the  wave 
equation  (1)  yields  a second  order  ordinary  differential  equation  for  f (x): 


A * k2n2<x)  - 
_dx 


fv(x)  = 0 


(5) 


which  has  the  general  solution 

f (x)  = A <}>  (x)  + D 4i  (x) 
v v v v^v 


(6) 


where  A^  ar.d  are  constants  independent  of  x,  and  ^(x)  and  (x)  are  linearly  inde- 
pendent solutions  of  the  differential  equation  (5),  which  shall  be  chosen  as  decaying  and 
growing  solutions,  respectively,  as  |x|  — ►». 


1.  Unbounded  Core 

The  refractive  index  profile  for  the  unbounded  medium  is  of  the  form 
2 2 —2 

n (x)  = n - n (x)  -°°  < x < oo  (7) 

For  guided  modes,  which  decay  at  |x|  — • »,  it  is  necessary  to  eliminate  the 
growth  function  +„(*),  whence  E 0 in  Equation  (6).  Furthermore,  it  is  assumed  that 
the  function  <j>v(x)  is  now  characterized  by  integral  values  of  the  parameter  v so  that 
v-l  =0,1,  2,  •••.  Thus,  the  modal  field  solutions  for  the  unbounded  profile  becomes 

f„(x)  = f^x)  = Aj  ^(x)  , 

with  discrete  eigenvalues  p^  . 


(8) 
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2.  Bounded  Core 

For  the  bounded  core  profile  in  Eq.  (2),  the  assumed  solution  is  of  the  form 


f (x)  exp(iB  z) 

u (x,  z)  = 

fv(xc)exp(-av|x-xc  | )exp(ipyz) 


I x I < *c 


X > X 


.vhere  a = f — u (x, z)/u  (x,  z)l  + is  the  transverse  wavenumber  in  the  exterior  re- 
v 1 8x  v v ,Jx=x(T 

gion, 

«v  = [ pj  - n2(xc+)]l/2  (10) 

and  x*  is  on  the  exterior  side  of  the  boundary  (x*  = x^  + 6,  6 > 0,  6 — 0).  A choice  of 
positive  ny  assures  decay  as  | x | — ■ <». 

The  solution  in  the  core  region  is  the  same  as  in  Equation  (6).  However,  now 
D / 0,  and  v is  not  required  to  be  an  integer  but  is  determined  from  the  boundary  con- 
ditions at  | x | = xc:  continuity  of  the  tangential  components  of  electric  field  (uy)  and 
magnetic  field  (oc  9u  /9x).  Continuity  of  uy  is  already  satisfied  by  the  assumed  form 
in  Equation  (9).  Continuity  of  9uy/9x  leads  to  the  dispersion  equation 

f'{x  ) + a f (x  ) = 0 (11) 

V C V V c 

where  the  prime  denotes  differentiation  with  respect  to  x.  Here,  fy(x)  implies  that  in 
the  expression  for  <t>^  (x),  4>|(x)  and  P^  for  the  unbounded  core,  the  integer  l is  replaced 
by  v.  The  previously  discarded  growing  solution  4^  (x)  must  now  be  included  in  the 
analysis . 

To  simplify  solution  of  the  dispersion  equation,  and  to  exhibit  the  effect  of  trunca- 
tion of  the  core  medium  as  a perturbation  about  the  unbounded  case,  the  normalization 
coefficients  Ay  and  Dy  in  Eq.  (6)  should  be  chosen  appropriately.  Since  the  refractive 

index  profile  considered  here  is  symmetric  with  respect  to  x,  the  modal  fields  separate 

2 

into  two  sets  having  even  and  odd  x-symmetry,  respectively.  Following  Hashimoto, 
we  shall  normalize  the  functions  4>v(x)  and  4'v(x)  in  such  a manner  that  fv(x)  can  be 
expressed  as 


f°(x)  = 
v 


IT  V 
COS  -7- 

I 

TT  V I 

sin  — 


<t>v(x)  + 


TT  V 

" sln  ~2~ 

IT  V 
COS  — 


where  superscripts  e and  o designate  even  and  odd  modes,  respectively.  When  v takes 
on  even  or  odd  integer  values  i , this  solution  reduces  to  that  in  Eq.  (8)  for  the  unbound- 
ed profile.  Explicit  retention  of  the  decaying  function  <j>y  and  the  growing  function  in 
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the  even  or  odd  modal  field  solutions  is  useful  not  only  for  describing  the  perturbing 
effect  of  the  exterior  medium  but  also  for  the  subsequent  development  of  an  asymptotic 
theory  for  determination  of  f^.  For  the  special  profiles  discussed  below,  the  known 
even  and  odd  solutions  of  Eq.  (5)  can  be  manipulated  into  the  canonical  form  Eq.  (12) 
by  use  of  relevant  circuital  relations,  as  will  be  demonstrated.  For  the  parabolic  pro- 
file, Hashimoto2  obtained  the  split-up  in  Eq.  (12)  by  a more  complicated  manipulation 
of  contour  integral  representations. 

Using  Eq.  (12),  one  obtains  from  Eqs.  (11)  and  (3)  for  mode  fields  with  even  or 
odd  x- symmetry,  for  any  integral  t: 

■n  Av , 6'  (x  ) + a ()>  (x  ) 

l vvv  c'  vTv  c'  (13) 

tan  2 " (x  ) + a (x  ) 

For  modes  with  Avf  « 1,  this  transcendental  equation  can  be  solved  approximately  by 
assuming  that  $ (x)  « <(>|  (x)  and  +v(x)  = 4.|  (x),  and  by  replacing  the  tangent  by  its  argu- 
ment: 


Av 


_2  4>;(*c) + W 

" + } (xc)  + (xc) 


(14) 


This  approximation  is  generally  applicable  to  lower  order  modes  which  are  minimally 
affected  by  the  exterior  medium.  It  also  can  be  employed  as  the  first  iteration  in  a 
numerical  scheme  for  determining  improved  values  of  Av^  from  Equation  (13). 


C.  Parabolic  Profile 

The  two  dimensional  parabolic  profile  is  defined  as  (see  Fig.  1) 


n2(x)  = 


0 

2 


2 2 

aQx 


1*1  <Xr 


X > x_ 


(15) 


where  aQ  and  n£  are  constants  and  nQ  = n(0).  Thus,  Eq.  (6)  becomes  for  the  uncladded 

medium  (x  — 00 ): 
c 


2 q 


g| (h)  = 0 


(16) 


where 

gf  (q)  = (h)  exp(q2/2) 


(17) 


q = X , q c = xc 


(18) 


r 
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with  the  solution  (bounded  at  |x|  — 00 ) 


g,  (q)  = Hf (n) 


(19) 


P|  = n0k 


l - 


a0(2|  + 1) 


kn. 


1/2 


(20) 


Here,  (t|  ) is  the  Hermite  polynomial  of  the  l- th  order,  and  1 =0,1,2,  •••  is  an 
integer  that  can  be  interpreted  as  the  guided  mode  index. 


To  deal  with  the  cladded  profile  Eq.  (15),  we  change  I into  v =i  + Av^  in  Equation 


(20).  The  modified  equation  (16)  now  has  the  two  linearly  independent  solutions 
8v(ri)  = Hv(h)  and  gv(q)  = hv(h)  = exp(q2)  H_v  l(iri),  which  are  the  Hermite  functions 


V v V 

of  the  first  and  second  kind,  respectively.  These  functions  can  be  represented  as  fol- 
lows^: ^ 


Hv(n)  = TT-1/2  [cos  ^ nj  +i)1F1(-|.  i n2)  + 2q  sinYrd  + 'IW^-^l.h2)] 

(2 

hv(n)  = «-l/Z  [-sin^ni  +i)1F1(-|,j,r12)  + 2q  cos 


(21a) 


(21b) 


where  ^F^  is  the  confluent  hypergeometric  function  defined  as 


(c) 


and  T(q)  is  the  gamma  function  with  argument  q.  The  Wronskian  of  the  Hermite  function 

. 4 t 
is  1 


Wti[Hv(t1),  hv(n)]  = 1r'l/22l'Vr(l  + v)exp(n2) 


(22) 


One  may  now  define  g^e^(r|)  and  g|^  ^(q)  as  even  and  odd  functions  with  respect  to 


n: 


(e).  , = -1/2  1 v . _ , v _1  2. 

g (q ) — if  r(,  + 9),F1(-7,  7,  q ) 


(o),  . - -1/2  — .,  v.  — . 1 v 3 2. 

gv  (q)=  ff  r(! + 2)ti1F1(  2'  2'2,n  > 


(23a) 

(23b) 


ty 

For  convenience,  we  have  multiplied  the  functions  in  Ref.  4 by  2 to  obtain  the 
expressions  in  Equation  (21). 


OPTICS 


129 


whence  from  Eqs.  (21a)  and  (21b): 

r,  / v ir  v (e).  . . . ttv  (o).  . 

Hv(n)  = cos  gy  (n)  + sm  — '(q) 

...  . 7T  v (e),  . ir  v (o),  . 

hv(n)  = -sm  — gj,  (o)  + cos  — g'y  '(h) 


Conversely, 


g,,  '(h)  = cos  — Hv(t!)  - sin  — h„(r|) 


2“  \ 
V 


g(v0\n)  = sin  ^ Hv(r1)  + cos  ^ h^p) 

The  modal  fields  in  the  bounded  core  region  are  therefore  given  by: 


(24a) 

(24b) 

(25a) 

(25b) 


f(vo)(n)  = exP(-  -^r 


TT  V 
2 
7 T V 
2 


Hv<n>  + 


TT  V 
2 

TT  V 
2 


V’l) 


(26) 


where  the  upper  and  lower  symbols  refer  to  solutions  with  even  and  odd  x- symmetry, 

2 

respectively.  Hashimoto  obtained  the  same  relation  (with  slightly  different  normaliza- 
tion of  h^(r)))  using  a more  complicated  procedure  based  on  contour  deformation  in 
integral  representations  of  the  solutions  of  the  differential  equation  (16)  (with  i -*•  v). 

Equation  (26)  is  in  the  canonical  form  Eq.  (p2)  whence  one  can  identify  the  decay 
■and  growth  functions  as 


<j>v(n)  = exp(-  -^-)Hv(t1) 

(27) 

2 

4*v(h)  = exp(-  -^-)hv(ii) 

(28) 

Moreover,  from  Eq.  (20)  (with  t — • v)  and  Eq.  (10), 

2 2.  2 , , , i,2  2 

c*v  = n^k  - ka^(2v  + 1)  - k n^ 
or  for  the  special  case  n^  = n(xc)  (see  Fig.  1), 


a 

v 


ka.x 
0 c 


1 - 


2v  + 1 

. 2 
kanx 
0 c 


1/2 


(29) 


(30) 


The  modal  eigenvalues  are  then  obtained  from  Eq.  (13),  using  therein  the  functions 
identified  in  Equations  (27)  to  (30). 


■ 


1 
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For  comparison  with  results  derived  by  the  high-frequency  asymptotic  method  to 
be  treated  elsewhere,  we  list  here  the  asymptotic  behavior  of  the  solutions.  From  Eq. 
(18),  the  high  frequency  limit  (ka^  large)  corresponds  to  q » 1 provided  that  x ^ 0. 
Alternatively,  r)  » 1 can  be  achieved  for  moderate  ka^  but  sufficiently  large  x.  These 
restrictions  are  necessary  to  validate  the  asymptotic  approximation.  The  same  re- 
marks apply  to  qQ. 

The  asymptotic  expansions  for  the  Hermite  functions  for  q » 1 are^ 


2 °°  (-l)m(-v),  , 

(q)  qV  exp(-  -^-)  Yj  ~ 21  (2n)~  m 


m=o 


m: 


-l/2  v . . -v-1  , n2  . V *v  + ^2m  ,,  ,-2m 

tt  2 T(l  + v)q  exp(-^-)  ^ (2t1) 

m=o 


(31a) 

(32) 


One  may  expand  p^  from  Eq.  (20),  with  t — v,  in  inverse  powers  of  k: 


P ~ n.k 
v 0 


a (2v+l)  a2(2v  + l)2 

1 *“2 42  ^Qt-S) 

2 n„k  SnTk^  kJ 


(33a) 


but  we  observe  that  p^  is  given  exactly  by 


n2  2.  2 
P = n.k 
v 0 


1 - 


a0(2v  +1) 


kn 


0 


(33b) 


For  perturbed  parabolic  profiles,  it  is  therefore  preferable  to  obtain  the  asymptotic 

2 2 1/2 

expansion  for  p , and  to  define  p as  (p  ) ' . 
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Fig.  2(a).  Unbounded  hyperbolic  tangent  profile. 

(i)  hyperbolic  secant  profile  aQ  = nQ 

(ii)  a0  < n0 

(iii)  a0  > n0 


_IC  *C 


Fig.  2(b).  Bounded  hyperbolic  tangent  profile  with  aQ  < nQ. 

(i)  n2  = n(xc) 

(ii)  n2  > n(xc) 

(iii)  n2  < n(xc) 
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n = -v*  -v+1*  *Y  + n (39) 

where  n is  the  largest  integer  less  than  y?  This  restriction  can  be  written  as 

fi  + y - i i t - 0,1,  2,  • • • , n , < 0 (40) 

and  one  can  identify  the  integer  / as  the  mode  number.  Therefore,  the  solution  for  the 
uncladded  profile  is,  to  within  a multiplicative  constant, 

fj  (x)  oc  P^(tanh  bx)  , fi  + y = I (41) 

while  from  Eq.  (37), 


2 2 2 
f^(fx)  = b V 


. x , 2 2,.  k ,2 

1 + (nQ  - a J(  — ) 


Since  from  Eq.  (37), 


Y = 

and  from  Eq.  (40) 


ka  r 1 

_i+l!o  ! + (-b_)2 

2 + b 1 ' 2kaQ  ’ 


(42) 


(43) 


, lx1  ^0  fl  X I b 

H - t - y - l + 2 - b j^+(  2ka^  ) J 

one  may  express  as  a function  of  I : 


1/2 


(44) 


Pj  = k2l"n  - ^(2i  + 1) 


1 + (2fc)2 


1/2 


+ (£)2(f2  + l + 7) 


(45) 


For  the  cladded  profile,  < is  replaced  by  v in  Eq.  (44)  and  Eq.  (45)  and  the  solu- 
tion fv(t,)  of  Eq.  (38)  is  written  as  a linear  combination  of  P^(I,)  and  Q^(£),  with 

fl  + Y = v (46) 

The  Legendre  functions  can  be  represented  in  terms  of  the  hypergeometric  function 
8 t 

in  the  form  ' 


t 


In  order  to  match  the  solution  of  Eq.  (47)  to  the  solution  Eq.  (21)  for  the  parabolic 


profile  at  x = 0,  the  representations  for  P^(£)  and  Q^(£)  in  Ref.  8 are  renormalized  by 

Y Y 11  2 

multiplying  by  c j and  c^  respectively,  where  c^  = 2"Mr(l  + -j  - fi)  and  c^  = — c^  . This 
yields  Equation  (47).  There  is  a misprint  in  Ref.  8:  the  definition  of  Q^(£)  there  should 
be  corrected  by  multiplying  the  first  term  by  (-x). 


For  notational  simplicity,  we  retain  the  parameters  p,  v and  y although  p can  be  expres- 
sed in  terms  of  v via  Eq.  (44),  with  i — v,  and  y = v - fi. 


Solutions  and  ) with  even  and  odd  £- symmetry  can  now  be  defined  as 

(l-t2)'l‘/2[r(-J--j)eo»  Y]  'f,  = »'1/2r(-J+'|)(l-l2)‘,‘/2F1  (49a) 

,«°>a) , -247 (i-t2)'l‘/2r(i+|- n)  [.i»  Yr«I+l’  ^ I»]  ‘i  f2 


I 

= 2W'l/2ra+7)[r(i+-|-fx)/r(-|  + -|-ii)](i-;2r^/2tF2  wh) 

whence  from  Eqs.  (47a)  and  (47b) 

P^(t.)  = COS  Y f(ve)(^)  + sin  ¥ f(v°)a)  (50a> 

Q^(U  = -Sin  Y f(ve)(^)  + cos  ¥ f<v°)(;)  (50b) 

Conversely, 

f(ve)(t, ) = cos  Y ) ' sin  Y ) <51  a) 

= Bin  Y + cos  ¥ Q^(^)  (51b) 

which  can  be  combined  into  the  canonical  form  Eq.  (12): 


which  can  be  combined  into  the  canonical  form  Eq.  (12): 
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(g) 

C (U 


I 

cos  — r— 

1 -sin“T  i 

= 1 

l 

1 TTV  I 

' sin  “y“  I 

| ^ + 

/ 7T  V 

« cos  — =-  J 

| Qv(U 

(52) 

Thus,  referring  to  Eq.  (12),  one  identifies  the  decaying  and  growing  solutions  as 

<j>v(x)  = P^(tanh  bx)  (53a) 

v(x)  = Q(J(tanh  bx)  (53b) 

The  modal  eigenvalues  are  inferred  from  Eqs.  (13)  and  (53),  where  from  Eq.  (10) 
and  from  Eq.  (45),  with  I — v. 


a - k 
v 


) 2 2 ba0  ,,  , ..  f,  ^ , b ,2 

Jn0  ' n2*  k (2v  + l)  1 + ( 2kaQ  * 

. 


1/2 


. b .2  . 2 , 1, 

+ (£>  (v  + v + ^ ) 


1/2 


(54a) 


For  the  special  case  n^  = n(xc) 


a - ka_  £ < 1 
v O^c 


kan^ 


0bc 


1/2 


. . b .2.2.  2 . 1. 

+ ( 2)  t (v  + v + rr) 

ka_£ 

0”C 


1/2 


(54b) 


The  representations  Eq.  (47)  for  the  Legendre  functions  are  not  appropriate  for 
the  high  frequency  range  because  the  parameter  fj,  which  is  proportional  to  k (see  Eq. 
(44)),  appears  in  the  numerator  of  the  series  expansion  Eq.  (48)  for  F^  and  F^ . Instead, 
another  representation,  which  has  p in  the  denominator  of  the  series  expansion,  has 
been  selected  from  the  variety  listed  in  Ref.  9 (this  representation  is  normalized  in 
accordance  with  the  footnote  on  page  132); 

P£ft>=  [r(l+|- Ji)r(l- |i)]  tVd-tV,l/i2F1(-|,^-|,l-M.l-t-2)  (55a) 

Q*J(t)  = 2'vir*l/zr<i  + v)[r(i+|-fi)/r(|  + v-fi)]t'v'1(i-  zf/z 

2Fl(1  + 2’2  +7*i+ (55b) 

2 

The  corresponding  asymptotic  expansion  for  p^  is  from  Eq.  (45),  with  I v: 


< ■ »2[»o2  • * .)  T1  * £ <v2  * v ♦ *,  - Ugg  * o,k-5)] 


(56) 


For  special  case  a^  = n^,  the  hyperbolic  tangent  profile  becomes  the  hyperbolic 


secant 
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n 


sech  (bx) 


(57) 


which  has  the  important  property  that  all  of  its  guided  modes  travel  with  the  same  gr  :>up 


velocity,  v^.  From  Eq.  (42),  p^  = bp  or 


fi  = bv  + 7 - kn. 


1 + (~r— ) 
ckn0 


(58) 


d3 


whence,  the  group  velocity,  defined  by  v^  = — , is  independent  of  v. 


The  modal  solutions  for  the  hyperbolic  secant  profile  can  be  determined  from 
Eqs.  (55),  (54),  (58)  by  substituting  a-  = n^. 
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GUIDED  MODES  IN  A GRADED  INDEX  OPTICAL  FIBER  SURROUNDED  BY  A HOMO- 
GENEOUS CLADDING 

E.  Navon  and  L.  B.  Felsen 

A.  Introduction 

Modal  field  solutions  for  graded  index  optical  fibers  surrounded  by  a homogeneous 
cladding  must  generally  be  obtained  by  approximate  methods  suitable  for  the  high  fre- 
quency range.  To  check  the  validity  of  these  methods,  it  is  important  to  have  special 
exact  solutions  available  for  comparison.  Here,  such  a solution  is  developed  for  fibers 
with  a parabolic  index  core.  The  formulation,  carried  out  in  a manner  that  can  be  adapt- 
ed to  the  construction  of  asymptotic  solutions  by  the  evanescent  wave  tracking  method, 
is  analogous  to  that  described  for  slab  waveguides  elsewhere  in  this  report.* 

B.  General  Formulation 

We  seek  angularly  periodic,  bounded  solutions  E(r,0,z)  and  H(r,  0, z)  of  the  vector 
wave  equation  in  cylindrical  coordinates.  The  refractive  index  n(r)  is  assumed  to  be 
longitudinally  and  angularly  independent.  It  may  describe  an  inhomogeneous  core  of 
radius  surrounded  by  an  infinite  homogeneous  cladding  (see  Figure  1) 


(•) 


<»> 


Fig.  1.  Various  profile  shapes,  (a)  General  symmetrical  profile,  (b)  parabolic  pro- 
file.   unbounded  profile  bounded  profile  with  (i)  continuous  n n(rc); 

(ii)  nz  > n(rc);  (iii)  n^  < n(rc). 

where  n^  = n(0),  n(r)  is  a given  analytic  function  of  r,  and  n^  is  a constant. 

When  solving  the  three-dimensional  vector  wave  equation  in  cylindrical  coordinates 
by  the  method  of  separation  of  variables,  one  gets  two  separation  parameters  and  p 
for  the  angular  and  radial  domains  respectively.  The  periodic  angular  dependence  is  of 
the  form  exp(inO),  where  n takes  on  integral  values,  which  may  be  taken  as  positive  or 
zero,  without  loss  of  generality:  n-0,1,  l,  •••  . The  radial  parameter  p for  modal  fields 
bounded  at  infinity  in  an  uncladded  profile  can  be  identified  with  the  integers  p m = 0,  I, 

^ One  must  distinguish  between  the  variable  refractive  index  n(r)  and  the  angular 
parameter  n. 
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l,  • • • . In  the  core  region  r£r^  of  a cladded  profile,  boundedness  at  infinity  is  not  re- 
quired whence  p can  assume  non-integer  real  values: 

p = m ♦ Ap  (2) 


I'he  cladding  effect  is  thus  expressed  via  Au^  which  is  determined  from  the  disper- 
sion equation. 

2 3 4 

Kurt/,  and  Streifer  , and  later  Hashimoto  and  Ikuno  found  that  for  a weakly 

inhomogeneous  medium  (n(r)<  n()  for  r < r ) the  vector  wave  equation  can  be  re- 

l .i  ed  ti  i sei  ond  order,  ordinary  differential  equation  for  a scaler  wave  function 

R 1 (r  1 of  the  form 

u.  q 


R ( r ) = 0 

u,  q 


with 


q * | I ♦ n| 


i : I.  2 


(3) 


(4) 


vhere  iu  exp  in**  ♦ i,  zt  dependent e is  assumed  and  the  upper  (lower)  sign  cor- 
respond  to  i a ! i * 2).  The  solution  of  (3)  can  be  expressed  as 


R 1 trl  » A1  d> 1 (r)  v D1  k 1 (r)  l«  1,2 

u,  q u.  q u,  q U,  q U.  q 


(5) 


i • ere,  vithout  loss  of  generality,  the  linearly  independent  functions  r ) and  i/(r) 
ire  iss  ,med  to  be  de>  aying  and  growing,  respectively,  as  r * oo  . 

1.  L’nt  ladded  Profile 
1 he  uni  ladded  profile  is  of  the  form 


2,  . 2 -2.  . 

n (r)  * nQ  - n (r) 


0 < r < ae 


(6) 


f r w 'm  h the  inhomogeneous  core  extends  to  infinity  and  the  solution  (5)  should  hold 

f.  r U r / . Boundedness  at  r • / excludes  i (r)  from  the  solution.  More- 

over,  it  is  is -umed  that  the  function  (r)  is  now  described  by  integral  values  of 

ut  q 

u - m * 0,  1 , 2,  . . . so  that  the  r dependent  part  of  the  solution  for  the  uncladded  prow 
file  is 


R1  I r ( s R ' ( r ) * A <f>'  (r)  i = 1,2 

u,  q m,  q m,  qvm,  q 


2.  Cladded  Profile 


(7) 


For  the  cladded  profile  Fq.  (1)  the  solution  in  the  core  region  is  Rl  (r)  (see 

Ut  H 


Fq.  (51),  and  in  the  outer  region  the  solution  is  assumed  in  the  form 


R1  (r  ) Tk  <o  r )/K  (o  r )1 
U.  q c L n u,  q n u,  q c J 


, r > rc  i = 1.2 


(«> 
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where  K (w)  is  the  modified  Bessel  function  of  n-th  order  and  a is  the  trans- 
n U,  q 

verse  propagation  coefficient  defined  as 

r i !/2 


a 


u,  q 


(9) 


At  the  boundary  r « rc  the  required  continuity  of  the  tangential  electric  field  is  al- 
ready satisfied  by  the  assumed  form  of  Equation  (8).  Continuity  of  the  tangential 
magnetic  field  requires 


^R1  (r  ) + o R 1 (r  ) = 0 i = 1.  2 

or  U,  q c u,  q u,  q'  c'  *•  * 

where  for  the  special  case  n.j  = n(r  ) it  can  be  shown  that 


1 + n 


9 

o"  r 


U,  q 


K’n(w) 

■■  K (w) 
n 

w=  r a 
c 

i = 1.  2 


(10) 


(11) 


and  the  prime  denotes  differentiation  with  respect  to  the  argument.  Normalizations 

will  be  sought  such  that  the  r -dependent  part  of  the  solution  Rl  (r)  can  be  cast  into 

U»  9 

the  canonical  form 


Rl  (r)  = cos  n u ft1  (r) 

q U,  q 


sin  nun  ( r ) 


i = 1,  2 


(12) 


which,  u * m,  reduces  to  the  uncladded  profile  solution  Equation  (7).  For  the  para- 
bolic refractive  index,  the  form  in  Eq.  ( 1 2 ) was  also  obtained  by  Hashimoto^  via 
manipulation  of  contour  integral  representations  of  the  solution  for  the  unbounded 
profile.  We  shall  use  an  alternative  procedure  based  directly  on  known  properties 
of  the  confluent  hypergeometric  functions. 


F rom  Eq.  (12)  and  the  dispersion  equation  (10),  one  obtains  the  transcendental 

equation  for  Su  in  Eq.  (2), 
m 

<b  (r  ) + o<f)(r  ) 

tan  r u - tan(  rAu^)  = — , — — i = 1,2  (13) 

<J  (rc  ) + »</  (r^) 


where,  as  a matter  of  convenience,  we  omitted  the  subscripts  u and  q;  the  prime 
denotes  differentiation  with  respect  to  r and  o and  is  defined  in  Equation  (11). 
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C.  Parabolic  Profile 

The  parabolic  profile  is  defined  as 


n“(r)  = { 


2 a 2 2 
0 " 0 


r < r 

— c 


r > r 


(14) 


2 c 

The  solution  for  the  uncladded  case  r — oo  is  nown  to  be2 


Rm,  q = exp(-ka0r2/2)  T^,(kanr2)  i * 1.  2 


-i  i iva.-.  i 

m ' 0 


also 


„2  ,2  2 
p = k n„ 
m,  q 0 


1 - 


2aQ(2m+q+l ) 


km 


i = 1,  2 


(15) 


(16) 


where  q and  m are  the  modified  angular  (see  Eq.  (4))  and  radial  mode  indices, 
respectively,  and  ^ is  the  generalized  Laguerre  polynomial. 

For  the  core  region  of  the  cladded  profile,  m is  replaced  by  u as  in  Eq.  (2), 
and  the  Laguerre  function  is  expressed  via  the  confluent  hyper  geometric  func- 

tion , F.  : J 


1 r 


hulr2>-  TTTwftWi 


where 


(.2  , 2 

£ = kaQr  , 


.2  , 2 
F = kanr 
c 0 c 


(17) 


(17a) 


Decomposition  of  the  solution  into  the  decaying  and  growing  functions  of  the  canonical 
form  Eq.  (12)  can  be  carried  out  by  use  of  the  connection  formula  between  the  first 
( 1 ^ 1 ) and  second  (U ) confluent  hypergeometric  functions^’ 


lFl(~u‘  1+9«  p2>  = ru'+q+u)  cos  ,uU('u>  1+9* 


+ exp(  + £2)  Ud+U  + q.  l+q, 


(1  9a) 


where 


'I 
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U(a'c'w,=  1ufcl  [r<c)r<l+a-o] 


- w1  -cjY(a)  r(2 

which  with  Eq.  (17)  yields 


-'•]  '*  ,r. 


(a+l-c,  2-c,  w) 


L<q)(P“)  = cos  vu 


jr(l+u)J  'ui-ttltq,?3 


- sin(w)7r-1  r(l+u  + q)exp(p2)  U(l+u+q,  1 +q,  -F2) 


Comparison  with  Eq.  (12)  gives 


q(r)  =rC3exP  (-ka0r2/2)|^r(l+ii) j U(-m,  1+q,  kaQr2) 

H>lUt  q = rqexp(ka0r2/2)r(l+U+q)  tt'1  U(l+w+q,l+q,  -kaQr" 
The  propagation  coefficients  are: 


0p,q=k2nO  1 |(2m+q+l+2Au  ) (21 

kn0 

where  Aum  is  to  be  determined  from  Eq.  (13),  using  the  functions  in  Equation  (20). 

The  asymptotic  expansion  of  the  confluent  hypergeometric  function7  for 
w — 30  , - ~ < arg  w < — is 


U (a,  c,  w)  = w 


a i (a). (a-c  + 1 ). 

I (-l)j  — 1 t 1 + 0(|w|"a"1_J).  j = 0,  1,  2, 

ilwj 

j=o  J 


Thus,  for  p2  = ka^r 2 - oo 


■ (-w).(-p-q). 


q<f,=  exp(-F2/2)P2u+q|Y(l+«)J  X (-l)j  -^4^ 


<4  qU)=  exp(F2/2 )p-2w-q-2  r(l+H+q)  1 


For  the  Bessel  functions,  one  has^ 


(l+U+q).(l+U) 

I J H f 

3=o 
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-wK  (w)/K  (w)  = w 
n n 


1 . 4n  -1  , -3 , 

1 + 2^  + + °(w  1 


(24) 


It  is  convenient  to  introduce  a * combined  mode  parameter"  v instead  of  u 
v a 2u  + q (25) 

With  this  change  the  asymptotic  expressions  (23)  become 

1 "1  r 


<t>,  n(?)  = exp(-?2/2)?1' 
V » H 


r(^) 


i - 


V±£ 

4?2 


(v2-q2)  [(l/-2)2-q2] 


32? 


+ 0(?  ) 


(26a) 


(?)  = exp(P2/2)?  ~V  ~2  ff_1  T(^— ) fl  + 
v,  <1  2 1 4? 

[(i'+2)2-q2]  [(i/  + 4)2  - q2  ] _6| 

+ 4 + 0(?  6) 

32?  J 


(26b) 


Also 


a2  x2  2 

P „ = k nn 
v,  q o 


2ao 

1 - (l/+1+Al/m) 

kno 


Moreover,  A u 


Ai/  / 2 in  Equation  (1  3). 
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THE  DIFFRACTION  OF  GAUSSIAN  BEAMS  BY  PERIODIC  LAYERS 
R.S.  Chu,  J.A.  Kong  and  T.  Tamir 

A.  Background 

Because  of  its  relevance  to  holography,  to  the  diffraction  of  light  by  sound,  as 

well  as  to  other  applications,  the  scattering  of  a plane  wave  by  a periodically  modulated 

layer  has  been  treated  extensively  in  the  past  by  using  a variety  of  methods  } The  sim- 
2 

plest  approach  is  to  replace  the  layer  by  a half  space  wherein  the  field  is  expressed 
in  terms  of  two  first-order  coupled  waves;  however,  this  procedure  neglects  the  effects 
of  the  exit  boundary.  For  a layer  bounded  by  identical  media  on  both  sides,  the  con- 
figuration is  symmetric  and  its  relevant  electromagnetic  problem  has  been  solved  rig- 
orously by  Chu  and  Tamir.  The  asymmetric  configuration  of  a layer  bounded  by  dif- 
ferent media  on  its  two  sides  has  been  studied  with  a rigorous  modal  approach  by  Chu 
and  Kong.  ' 9 

If  the  incident  field  is  a realistically  bounded  beam,  the  diffracted  field  may  be 
quite  different  from  that  of  an  incident  plane  wave  of  infinite  extent.  For  a Gaussian 

beam  incident  at  or  close  to  the  Bragg  angle  on  a modulated  half  space.  Chu6  and  Chu 

7 8 

and  Tamir  ' have  shown  that  (a)  the  diffracted  beams  are  formed  by  depletion  of  ener- 
gy away  from  the  center  of  the  original  beam,  (b)  the  transmitted  beams  split  at  the 
exit  plane  of  the  film,  and  (c)  complete  conversion  of  energy  into  the  Bragg  field  cannot 

generally  occur.  Upon  examining  the  patterns  of  Gaussian  beams  diffracted  by  a thick 

9 

holographic  film,  Forshaw  verified  items  (a)  and  (b)  and  he  observed  ripples  in  the 
fa r-  field  intensity  of  the  emerging  beams.  Forshaw  also  derived9  the  far-field  profiles 
of  these  beams  by  using  a Fourier  transform  of  Chu's  field  solution1'  at  the  exit  plane 
of  the  modulated  layer.  While  the  patterns  thus  obtained  by  Forshaw  provide  an  exten- 
sion of  the  half- space  results,  they  do  not  predict  the  observed  ripples  in  the  far  field 
pattern . 

In  the  present  work,  we  treat  the  diffraction  of  a Gaussian  beam  by  modulated 
layers  and  derive  solutions  for  both  the  near  and  the  far  fields  by  integrating  over  an 
appropriate  plane-wave  spectrum.  The  computed  results  agree  with  Forshaw's  experi- 
mental observations  and  they  account  for  the  ripples  in  the  diffracted  beam  profiles. 

We  also  confirm  Chu  and  Tamir 's  conclusions  that,  in  contrast  to  previous  theoretical 
studies  which  assumed  an  incident  plane  wave,  complete  conversion  of  energy  into  the 
Bragg  field  cannot  generally  occur  in  the  case  of  bounded  beams.  The  reason  for  this 
incomplete  conversion  is  clarified  by  examining  the  spectral  contents  of  the  beam  energy. 

B.  Solution  for  the  Far  Field 


To  consider  the  diffraction  of  bounded  beams,  we  first  list  succinctly  the  pertinent 
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expressions  for  a plane  wave  incident  on  a periodic  slab,  as  shown  in  Figure  1.  The 


aid  Dfnnu  MOOUIATID 

AIR  REGION  MfOiOM 

Vo  M [ 
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gxt 


DIFFRACTED  BEAM 


Fig.  1.  Geometrical  configuration  of  the  problem, 
permittivity  of  the  slab  is  assumed  to  be 

e(z)=CqE^[I+McosKz]  (1) 

where  K = 2ir/A,  fn  is  the  permittivity  of  vacuum,  t . is  the  relative  permittivity  of  the 
slab  if  M -0,  A is  the  periodicity,  and  M is  the  modulation  index  of  the  periodic  medi- 
um. A plane  wave  with  perpendicular  polar ization  and  characterized  by  an  electric 
field  vector 

ik , x + ik_  z 

E = E_  e x °Z  (2) 

y 0 

is  assumed  incident  on  the  slab  from  z < 0,  at  an  angle  G with  respect  to  the  normal  to 
the  slab.  The  time  dependent  factor  e is  assumed  and  suppressed  in  Equation  (2). 
As  given,  the  incident  propagation  wave  vector  is  k = xkj^-f  zk^,  where  k^  = k ^ sin  0, 

k , = k,  cos  0,  and 

lxl 


i 2 TT  , . ,l/2 

ki  =—^1  = 

where  X is  the  wavelength  in  vacuum.  We  assume  that  the  incident  angle  0 is  at,  or 
near,  the  first  Bragg  angle  0^  so  that  the  transmitted  wave  consists  mostly  of  the  zero- 
th and  the  first  order  wave  components,  namely, 

b 

ik  , x + ik„  z ik , x + ik  , z 

E =T0e  3x  0z  fT.je  3x  ’lz  (3) 


where 
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3x 
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(4) 

(5) 

(6) 


The  transmission  coefficients  Tg  and  T ^ can  be  found  by  using 


and  k3  = w(  £^Q) 

the  second-order  coupled  mode  equations  and  by  properly  matching  the  boundary  condi- 
tions at  x = 0 and  L,  as  given  in  Reference  5.  Hence  Tg  and  T ^ are  well-known  for 
plane  waves . 


While  this  work  has  examined  the  transmitted  field  of  an  incident  Gaussian  beam 
in  both  the  near  and  the  far  field,  we  present  here  only  the  latter  field  in  order  to  com- 
pare it  with  Forshaw's  observation.  For  this  purpose,  we  use  the  Fraunhofer  approxi- 
mation 


i2-rr 


ik,x  + ik,z  /2x  °° 

3 3 ' r 


(x-  L) 


/ dz’V2,)e 


■ik^z  1 sin  0 


(?) 


where  now  E^  denotes  a field  produced  by  a bounded  beam  rather  than  that  due  to  an 

incident  wave,  while  E (z'l  is  the  beam  field  at  x = L.  Thus,  E is  the  Fourier  trans- 

ap'  y 

form  of  that  aperture  field.  For  incidence  near  the  Bragg  angle,  E (z1)  consists  of  the 

ap 

(zero-order)  refracted  field  and  the  (-1  order)  Bragg  field,  which  are  given  respective- 
ly by 


E°(z)  = /^G(k0z)T0(k0z)exp(ik0zz)dk0z  , 


(8) 


E-  1<Z>  = f C(kQz)  T_  l(k0z)  exp(ik0zz)  d kQz  , 


(9) 


where  T^(k_  ) and  T . (k^  ) are  known,  as  noted  before,  and 
0 Oz  -1'  Oz 


C(k0z)  =-f  exp[-(k0z-  k sin  eQ)2w2] 

Vir 


(10) 


is  the  transform  of  the  incident  Gaussian  field,  where  4Wg  is  the  beam  width  projected 
along  the  z axis  and  Gg  is  the  angle  of  incidence,  as  shown  in  Figure  1.  Thus,  for  in- 
cidence at  exactly  the  first  Bragg  angle,  0g  = 0g,  we  have 


kjSin0g  = 'rr/A=-|K  . 


(ID 
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By  introducing  Eqs.  (8)  through  (10)  into  Eq.  (7),  we  derived  the  field  patterns 
for  a large  number  of  typical  situations.  One  such  case  is  illustrated  in  Figs.  2 and  3, 


Fig.  2.  Far  field  pattern  of  the  zeroth  order  beam  for 
Dj  = 8 . 2475  mm,  X = 0.63  28  pm,  A = 1pm, 

£1  “ e3”  l’  € 2 ~ 2-25'  W0  = 500  and  M=  10'4- 


Fig.  3.  Far  field  pattern  of  the  Bragg- scattered  beam  for 
the  same  parameters  as  those  in  Figure  2. 


which  show  the  zeroth  order  beam  and  the  Bragg- scattered  beam,  respectively.  We 
compare  the  zeroth  order  beams  for  the  two  slab  thicknesses  L=  Dj/2  and  L,  = D^/4, 
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as  functions  of  A0  =0  - 0_  in  Figure  2.  Here  D./2  is  the  thickness  for  which  total  con- 
version  of  energy  into  the  Bragg  beam  would  be  expected  from  the  simple  plane-wave 
analysis.  In  contrast,  we  see  in  Fig.  2 that  only  a deep  null  is  produced  at  the  center 
position  0=0g=  18.445°,  but  otherwise  a two-lobe  zero-order  beam  of  substantial 
magnitude  does  appear  in  the  far  field.  This  agrees  with  the  analytic  and  experimental 
results  obtained  by  Forshaw.  The  physical  reason  for  this  deep  null  is  that,  at  L = Dj/2, 
the  central  portion  of  the  Gaussian  spectrum  of  the  zero-order  wave  has  completely 
converted  its  energy  into  the  Bragg- scatter ed  wave,  which  results  in  a depletion  of 
energy  from  its  beam-center  position,  as  shown.  For  the  case  L=  D^/4,  the  zeroth 
order  beam  has  only  a small  dip  at  0 =9g  because  only  partial  conversion  of  energy  is 
expected  for  this  slab  thickness. 

A comparison  of  the  Bragg- scattered  beams  for  the  two  slab  thicknesses  L = D^/2 
and  L = Dj/4  is  shown  in  Figure  3.  For  the  case  L = Dj/2,  the  beam  width  is  quite  nar- 
row and  side- lobe  ripples  are  obtained.  This  agrees  with  the  experimental  results  per- 

9 

formed  by  Forshaw,  even  though  his  analytical  results  have  revealed  no  ripples  for  the 
Bragg- scattered  beam.  For  the  case  L = Dj/4,  the  beam  width  becomes  broadened  and 
the  side-lobe  ripples  are  much  smaller. 

The  foregoing  considerations  and  results  can  be  shown'*'  to  be  consistent  with  the 
fact  that,  if 


wo>  w=  TmTT 


(12) 


then  most  of  the  incident- beam  energy  is  concentrated  around  the  Bragg  angle.  In  that 
case,  the  transmitted  field  behaves  very  much  like  that  of  a plane  wave.  For  the  case 
shown  in  Figs.  2 and  3,  condition  (12)  is  violated  so  that  total  conversion  of  energy  into 
the  Bragg  beam  does  not  occur. 

For  comparison,  we  show  in  Figs.  4 and  5 a situation  that  does  satisfy  w^  > W, 
in  which  case  nearly  total  conversion  is  obtained  if  L = D^/2,  as  expected.  Thus,  the 
case  of  Figs.  4 and  5 corresponds  well  with  the  plane-wave  analysis,  which  predicts  no 
transmitted  zero-order  beam  if  L = D^/2  and  transmitted  zero-order  and  Bragg  beams 
of  equal  magnitudes  if  L = Dj/4.  Furthermore,  ripples  occur  in  all  of  these  far  field 
patterns,  in  agreement  with  Forshaw's  observations. 

As  illustrated  by  the  above  examples,  we  find  that  our  results  provide  an  extension 
of  previous  theoretical  work  and  yield  analytical  expressions  consistent  with  the  most 
recent  experimental  evidence.  Further  details  on  these  aspects  have  been  presented 
in  a recent  article/^  which  examines  the  near  field  as  well  as  the  far  field  discussed 
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BRAGG- REFLECTION  APPROACH  FOR  BLAZED  DIELECTRIC  GRATINGS 
K.C.  Chang  and  T.  Tamir 

Optical  waves  supported  by  a dielectric  grating  in  integrated- optic s devices  are 
often  subject  to  considerable  losses  because  the  grating  scatters  energy  into  undesira- 
ble directions.  Thus,  in  a beam  coupler  as  shown  in  Fig.  1,  an  incident  surface  wave 


Fig.  1.  Conversion  of  a surface  wave  into  outgoing 
beams  in  a thin-film  grating  coupler. 

is  converted  by  the  grating  into  two  outgoing  beams,  one  of  which  radiates  into  the  up- 
per air  region  while  the  second  one  radiates  into  the  substrate  and  subsequently  into 
the  lower  air  region.  However,  usually  only  the  former  beam  is  utilized,  so  that  the 
energy  scattered  into  the  substrate  should  be  minimized.  This  can  be  achieved  by 
"blazing"  the  grating,  i.e.,  choosing  a suitable  asymmetric  profile  for  the  periodic 
layer,  as  already  verified  both  theoretically1'3  and  experimentally  4 

The  present  work  addresses  itself  to  the  choice  of  a suitable  design  for  achieving 
this  blazing  effect.  In  a previous  study,  Marcuse3  used  a geometric- optical  argument 
for  this  purpose,  but  he  pointed  out  that  his  approach  becomes  unwieldy  and  inappropri- 
ate for  deep  grating  grooves,  in  the  present  work,  we  use  a Bragg- reflection  analysis 
that  applies  to  arbitrary  grating  heights:  preliminary  results  indicate  that  this  analysis 
is  both  accurate  and  simple  to  use. 

The  principle  of  this  Bragg  approach  is  illustrated  in  Fig.  2,  which  describes  a 
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Fig.  2.  Extended  lattice  description  of  a dielectric  grating 
having  a general  trapezoidal  profile. 

rather  general  trapezoidal  grating  profile  shown  by  the  thick  solid  lines.  Consistent 
with  analytical  results  obtained  by  Tamir  and  Peng,^’^  the  dielectric  grating  exhibits 
the  following  properties: 


(1)  The  layer  of  height  tg  acts  as  a periodic  perturbation  of  the  (volume)  average 
dielectric  constant  Eg  inside  the  layer,  i.e.,  this  constant  is  given  here  by 

£g  =ta  + (do  + dl)(£f'  Ea)/2d  • (1) 


(2)  The  above  periodic  perturbation  scatters  an  incident  wave  into  directions 

determined  by  the  grating  facets,  which  are  characterized  by  normal  vectors 
Gj,  G2,  C3  and  G4  in  Figure  2. 

To  find  the  directions  of  constructive  scattering,  it  is  then  useful  to  extend  the 
periodic  grating  configuration  into  the  lattice  structure  indicated  by  the  dashed  lines. 

In  such  a lattice,  Bragg- scatter ing  theory  implies  that  strong  constructive  scattering 
will  occur  along  those  directions  that  satisfy  a Bragg  condition  with  respect  to  any  one 
of  the  following  vectors: 

Gj  = -(2mir/d)  (jCq  - cot  ) 


G2  = -(2nir/d)(x()  + cot  *2> 


C3  = -G4  = (2pn/tg)z{) 


(m,n,p  = ±1,  ±2,  ±3,  • • •)  , 


(2) 
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where  and  are  unit  vectors  along  the  respective  directions. 

To  simplify  matters,  we  shall  examine  only  the  possibility  of  first-order  Bragg 
interactions,  i.e.,  m - n - p = 1 . For  an  incident  plane  wave  given  by  the  vector  K.  in 
Fig.  2,  the  scattering  by  the  extended  grating  is  therefore  described  by  the  Ewald  dia- 
gram shown  in  Figure  3.  A Bragg  condition  for_G(i  = l,2,3,4)  is  then  satisfied  if  the 


^z 


(-*  - 2w/d 


I ic.  3.  Ewald  diagram  for  scattering  of  a plane  wave  incident 
within  the  grating  region  shown  in  Figure  2. 

vectorial  summation  of  jv^  and  Ch  yields  a vector  J<  whose  vertex  lies  on  the  circle  of 
radius  k k-  This  is  the  case  for  Ci^  in  Fig.  3,  where  strong  constructive 
interference  via  Bragg  reflection  occurs  into  the  upper  region  along  the  direction  given 
by  the  vector  iig  • While  this  arrangement  assures  Bragg  scattering  with  respect  to 
Gj,  leakage  along  j<g  will  be  further  enhanced  if  the  scattering  due  to  the  other  vectors 
C^,  C3  and  C4  is  appropriately  reduced.  The  effect  of  G3  and  can  be  easily  elimi- 
nated by  choosing  triangular  (rather  than  trapezoidal)  grating  profiles.  However,  the 
vector  may  then  still  account  for  appreciable  scattering  into  the  substrate  unless  it 
is  chosen  so  as  to  be  far  from  satisfying  a Bragg  condition. 

To  apply  the  above  principles  to  the  case  of  a beam  coupli  r,  consider  Fig.  4 in 
which  an  incident  surface  wave  is  desired  to  leak  most  of  its  energy  into  the  upper  (air) 
region.  The  appropriate  incident  wave  is  now-  shown  by  the  arrov.  denoted  by  (3  whose 
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Fig.  4.  Effect  of  varying  in  a triangular  grating.  In  this  case, 
Gq  has  already  been  adjusted  to  satisfy  a Bragg  condition 
for  scattering  an  incident  surface  wave  into  the  upper  (air) 
region. 


length  is  larger  than  k 0jr  Again,  we  choose  to  produce  Bragg  scattering  into  the 

upper  (air)  region  and  examine  possible  choices  for  C assuming  now  a triangular  grat- 

4 (4) 

ing  profile.  It  is  then  clear  that,  for  a symmetric  grating  indicated  by  G ^ , a nearly 

equal  amount  of  scattering  should  occur  into  the  air  and  substrate  regions,  because 
(4) 

both  CIj  and  (T,  ' satisfy  a Bragg  condition.  Other  typical  situations  for  G^  are  shown 

in  Fig.  4 by  Cj^  through  _C^.  Of  all  these  forms,  G^  =■  G^  would  be  optimal  because 

both  and  G^  would  then  satisfy  the  same  Bragg  condition  for  scattering  into  the  air 

region  only.  However,  Gy  — ■ Ci  requires  very  thick  gratings,  such  as  the  holographic 

4 1 7 

gratings  used  by  Kogelnik  and  Sosnowsky.  For  milled  or  etched  gratings,  on  the  other 

(2) 

hand,  the  situation  shown  by  Cl'  appears  to  be  most  realistic  and  this  corresponds  to 

4 (2) 

a right-angled  triangle.  In  such  a case,  G^  - G\,  ' ends  sufficiently  far  from  the  Bragg 


circle  so  that  energy  should  be  scattered  primarily  by  Clj  only 
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The  preceding  arguments  indicate  that  blazing  should  be  effectively  obtained  by  a 
grating  profile  having  a right-angled  triangular  shape  with  its  two  facets  satisfying  the 
construction  for  CIj  and  _G^  ^ shown  in  Figure  4.  It  is  then  easily  found  that  the  required 
relationship  for  that  desired  blazed  grating  is  given  by 


To  verify  this  simple  design  criterion,  we  have  calculated  the  efficiency  pa  of  scatter- 
ing into  the  air  region  for  the  grating  shown  in  Fig.  5,  where  tj  is  the  ratio  of  the 


Fig.  5.  Variation  of  efficiency  r)a  and  leakage  coupling  a\  in  a 
right-angle  triangular  dielectric  grating  on  which  a 
surface  wave  is  incident. 

energy  of  the  upper  air  beam  and  the  total  scattered  energy.  The  construction  pre- 
scribed by  Eq.  (3)  predicts  an  optimum  condition  at  t^/X.  = 0.  74,  as  indicated  by  the 
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a r row  inFigure  5.  Exact  calculations  show  that  rj  then  equals  0.95  and  0.90  for  TEq 
and  TM  modes,  respectively.  There  appears,  therefore,  to  be  an  excellent  agree- 
ment between  the  simple  Bragg- scattering  argument  for  determining  the  grating  profile 
and  the  actual  result.  In  addition,  we  also  note  that: 

( 1 ) Good  blazing  is  achieved  for  a wide  range  of  values , roughly  0 . 5 < tg/\  < 1.1. 
This  is  explained  by  the  fact  that  C3i  need  not  satisfy  the  Bragg  condition  ex- 
actly; its  vertex  must  only  be  sufficiently  close  to  the  large  circle  shown  in 
Fig.  3,  say,  within  the  region  denoted  by  the  smaller  circle  shown  dashed. 

(2)  In  the  above  range  of  values  for  tg/\,  both  TE0  and  TM0  modes  exhibit  strong 
blazing.  This  is  expected  because  the  propagation  factors  |3  for  these  modes 
are  nearly  equal,  so  that  the  Bragg  construction  in  Fig.  4 holds  for  both 
modes  . 


Figure  5 also  shows  curves  of  the  leakage  parameter  a\,  which  is  proportional 
to  the  beam  coupling  factor.  This  parameter  was  found  exactly  by  a numerical  method 
(shown  dashed)  and  by  a simpler  perturbation  method  ^ (shown  solid).  It  is  interesting 
to  note  that  leakage  is  larger  in  the  range  of  t /\  for  which  blazing  is  maximum. 

Dielectric  gratings  having  other  parameters  are  currently  under  investigation  in 
order  to  further  examine  the  generality  of  the  present  results. 
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EFFECTS  OF  PHASE  VARIATIONS  ON  OPTICAL  PARAMETRIC  MODE- COUPLING 
DYNAMICS 

E.S.  Cassedy  and  M.  Jain 

Parametric  mode- coupling  phenomena  in  laser-driven  plasmas  has  been  studied 

1 2. 

for:  (anomalous)  heating  mechanisms,  instability  saturation  mechanisms,  x-type 

3 4 

wave  breaking  criteria  and  interpretations  of  soliton  formation.  Similar  mode-cou- 

2 5 

pling  has  also  been  considered  for  ionospheric  applications,  ’ for  magnetically  con- 

t>  V 

fined  plasmas  and  in  the  study  of  ocean  wave  dynamics.  In  any  of  these  studies,  the 

possibility  of  multi-mode  par ametric ^ interactions  must  be  considered,  albeit  the 

fastest- growing  interaction  only  is  often  chosen  as  a simplifying  approximation. 

In  the  plasma  literature  the  time-dynamics  of  parametric  mode- coupling  have 
customarily  been  treated  as  an  initial- value  problem.  Such  a treatment  means  typical- 
ly that  the  (large-amplitude)  pump  mode  is  assumed  to  be  "turned  on"  (at  t = 0)  and 
that  the  other  (initially- small)  interacting  modes  grow  up  to  the  saturation  level  there- 
after. When  multi-mode  interactions  are  treated  in  this  manner,  the  various  modes 
have  been  assumed  to  belong  to  the  Fourier-mode  spectrum^  of  an  unbounded  plasma. 

Pump-driven,  bounded  plasmas,  on  the  other  hand,  have  been  considered  for 
three- wave  interactions  ’ of  the  parametric  type.  Here  the  time- space  evolution  of 
the  three  (coupled)  waves  is  assumed  to  be  governed  by  three,  coupled,  first-order 
wave  equations  (in  time  and  space)  with  appropriate  boundary  and  initial  values.  Here 
the  initial  condition  can  assume  zero-level  pump  fields  and  small  amplitudes  for  the 
other  two  waves  throughout  the  interior  bounded  region  of  interaction.  The  boundary 
values,  however,  include  a fixed  value  for  the  pump  field  at  one  boundary  to  represent 
the  pump  wave  incident  from  outside  of  the  region.  Thus  the  wave  solutions  in  these 
formulations  are,  in  effect,  driven  (or  sustained)  by  the  incident  pump  power. 

The  time  dynamics  of  the  parametric  interaction  of  modes  have  been  examined  in 
the  quantum  electronics  literature^  ^ for  application  to  optical  parametric  oscillators 
(OPOs).  The  modes  for  an  OPO  are  the  resonant  longitudinal  modes  of  a Fabry-Perot 
cavity.  First-order  modelling  equations  for  the  time  dynamics  of  OPOs  have  been 
given  by  Yariv  and  Louiselle/^  who  assume  only  a s ingle- longitudinal- mode , three- 
frequency  interaction.  These  equations  have  been  generalized^  for  multi-mode,  multi- 
frequency parametric  interactions.  Whereas  this  previous  work  had  "injection  tuning," 
of  OPOs  as  its  principal  objective,  the  modelling  equations  appeared  capable  of  dealing 
with  multi-mode  effects  more  generally,  within  the  assumptions  made. 


^The  term  "parametric"  is  used  here  in  the  generic  sense  and  includes  parametric- 
decay,  stimulated- scatter ing  and  purely- growing  interactions. 
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It  should  be  emphasized  that  these  modelling  equations  represent  the  coupled 

11  12 

modes  of  the  OPO  as  they  are  d riven  by  an  incident  pump  field.  The  equations  ’ 
are  a coupled  set  first-order  differential  equations  with  an  inhomogeneous  (pump,  driv- 
ing) term,  which  is  distinct  from  the  pump  mode  amplitude  (a  dependent  variable). 
Markedly  different  saturation  occurs  for  the  driven  case  as  versus  the  undriven,  as 
must  be  expected.  In  any  driven  multi-mode  case,  there  is  always  a final  steady  state 
(limited  by  mode  damping)  where  the  fastest  growing  mode  is  dominant  along  with  its 
idler  mode  and  the  pump  mode.  All  other  modes  (and  idlers)  decay  once  the  pump  mode 
amplitude  has  been  depleted  to  the  threshold  value.  In  the  undriven  case/  on  the  other 
hand,  saturation  seems  to  be  dominated  by  the  lightest-damped  modes  and  not  by  the 
highest- gain  modes. 

11  12 

The  modelling  equations  ’ are,  of  course,  deterministic  in  the  complex  am- 
plitudes. As  such,  they  would  appear  to  ignore  the  effects  of  random  phases13,  14  as 
versus  well-defined  phases,10  which  have  resulted  in  significantly  different  predictions 
on  stabilization  of  plasma  wave- wave  interactions.  It  should  be  noted  here,  however, 
that  these  previous  studies*  ' were  for  undriven  systems.  Our  purpose  here  is  to 
demonstrate  that  a driven  set  of  time-dynamic  equations,  representing  a bounded , para- 
metric-wave system,  has  a semi- well-defined  final  steady  state  in  the  phases  of  the 
interacting  modes.  The  phases  are  found  to  be  " semi- well- defined , " since  only  the 
pump-mode  phase  is  uniquely  determined,  for  the  usual  initial  conditions  (all  modes 
grow  from  noise  levels).  The  signal  and  idler  modes,  on  the  other  hand,  appear  to 
meet  a condition  concerning  the  sum  of  their  phases,  but  not  have  individual  unique  val- 
ues in  the  final  steady  state. 

In  addition,  the  results  to  follow  will  demonstrate  that  the  (mode)  amplitude  growth 
to  saturation,  for  the  entire  range  of  possible  initial  (mode)  phases,  is  bounded  by  a 
relatively  narrow  envelope  in  time.  Furthermore,  the  (mode)  amplitudes  settle  to  uni- 
que values  in  the  final  (saturated)  state,  regardless  of  (pump,  signal  or  idler)  amplitudes. 
The  results  would  suggest  that  the  assumption  of  initial  (mode)  phases  is  unimportant 
for  any  consideration  dealing  solely  with  amplitude  (or  intensity)  growth,  to  saturation, 
of  a driven  parametric  system. 

In  our  previous  study  of  OPO  dynamics only  absolute  values  of  the  signal,  idler 
and  pump  mode  amplitudes  were  plotted.  Since  the  mode  amplitudes  are  complex  quan- 
tities, the  phase  dependence  should  be  included  in  the  dynamic  behavior  of  the  OPO.  In 
order  to  study  the  effect  of  those  phase  variations,  consider  the  simplest  (single  mode, 


^ Large  initial  amplitudes  of  the  pump  mode  are  assumed. 
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N = 1)  set  of  normalized  (complex)  equations: 


dT  ' 'ap  ' ^a\a2  ala2  + Ip 


1 / * 

dT=-Vl  + Ma2ap 


2 * , * 

dT  = "a2a2  + yalaZ  alap 


i<J>  i<>,  »<t>? 

P _ A A 1 _ _ A _ ^ 


Separating  the  phases  by  writing,  a ^ e ^ , a ^ = A^  e and  a^  = A^e  .one  obtains 

a set  of  six  real  equations: 


dT  = ’Ap'  m A1A2  COs(4>1  + V V + Psin  ^p 


TF  = -alAl  + 2 A2Ap  COs(<<)1  + V V 


— = -Q2A2+  V^AlAPCos(<t>l  +<},2-  V 


Ap  dT  = * VV*2  A1A2  sin(4>l  + V V + P cos  4> 


IV2  A2Ap  sin(<f5l  + *2'  V 


A2-dT  = ■ l°Z  AlAp  Sin(<t,l  + V V 

The  final  steady  state  of  these  equations  is  obtained  by  setting. 


^(Ap.Aj.A^p,^)  =0 

Putting  this  condition  in  Eqs.  (4d)  to  (4f)  gives, 

sinl^j  + <)>2  - <bp)  = 0 

cos  <j>  =0 

P 

with  the -solution,  ^ = i tt/2  and  + <t>2  - — n/2,  T if/2.  To  find  the  steady  state 
(d/dT  = 0)  condition  for  Eqs.  (4a)  to  (4c)  we  then  examine: 

Ap  = * V®i^2  A1 A2  cos(<t>i  + * ^p^  + P sin  *Pp  (5i 

»1A1  = ->/®7^2  A2Ap  COS<0>1  + <t>2‘  ‘V  *51 
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aZAZ  = V"ia2  A1AP  cos(<1>1  + *2  ' V • <5c> 

Remembering  that  A , A , A > 0,  we  conclude  that  cos(4>.  + 4>_  - 4>  ) = 1 (the  - 1 
P l c.  1 c.  p 

value  being  nonphysical).  Also  sin  <j>  = 1 (the  -1  value  also  being  nonphysical).  There- 

fore there  is  a unique  solution  for  4>^  and  4>^  + 4^,  namely:  <(>  = it / 2 and  4^  + 4^  = tr/2. 

On  adding  Eqs.  (4e)  and  (4f)  it  is  seen  that  the  complete  set  of  six  equations  can 
be  represented  now  by  five  equations  which  depend  only  on  (4>j  + 4>^)  and  not  on  4^  and 
4>2  independently.  In  general,  in  the  final  steady  state,  the  values  of  4>  and  4>  will  de- 
pend on  the  initial  values  of  these  parameters  and  may  be  obtained  by  numerical  solu- 
tions (see  Figure  1).  These  values,  however,  will  always  be  such  that  4>  + 4>  = rr/2  in 
the  final  steady  state. 

On  putting  the  values  4>p  = tr/2  and  4>j  + 4>2  = ti r/2  in  Eqs.  (5a)  to  (5c)  and  solving 
simultaneously,  we  find: 


°Ta2AlA2 


a.  a A.  A A 
1 Z 1 Z p 


1 = Ai  —hr  Ai + p = -QiAf + p 


similarly 

Therefore  we  have  found  that  the  final  solution  is  unique  as  far  as  the  absolute  values 
are  concerned. 

A.  Intermediate  Dynamics 

Since  the  original  six  equations  (4a)  to  (4f)  are  cross  coupled,  the  intermediate 
values  of  Ap,  Aj  and  A 2 are  expected  to  depend  on  the  initial  phases  4>  , 4^  and  4> 
chosen.  However,  as  seen  before,  these  equations  reduce  to  an  equivalent  set  of  five 
equations  containing  (4>j  + 4^)  and  not  4^  and  4>^  separately.  Hence  the  intermediate 
dynamics  are  expected  also  to  depend  only  on  the  initial  values  4>  and  (4^  + 4^). 

Let  us  first  consider  the  variation  of  4>  under  the  assumption  that  A , A.  and  A 

P pi  2 

start  initially  from  very  small  values.  Near  time  T=0,  all  A , A,  and  A,  are  of  the 

pl  z 

same  order,  and  are  still  extremely  small,  and  hence  one  can  drop  the  Aj  A^ 

term  compared  to  the  A^  term.  Hence  at  short  times,  when  and  A^  are  very  small. 
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TABLE  I.  Initial  and  final  phases. 


Fig.  1 

initial  values 

final  values 

curve 

number 

^p  *1 

*2 

*P 

+ 

0 

o 

00 

0 

o 

0° 

0 

o 

O' 

154°  -64°  j 

90° 

© 

0 

o 

0 

o 

0° 

90° 

26°  64° 

90° 

© 

0°  135° 

0° 

| 90° 

121°  -31° 

90° 

o 

0 

LP 

0 

0° 

j 90° 

59°  31° 

90° 

© 

0 

o 

O' 

0 

o 

0° 

90° 

90°  0° 

90° 

0 

0 

1 

0 

0° 

i 90° 

-18°  ! 108° 

90° 

© 

1 

0°  -135° 

0° 

90°  | 

-162°  ! 252° 

0 

o 

O'- 

© 

0°  -90° 

0° 

0 

o 

O' 

-90°  i 180° 

1 

90° 

by  the  set  of  equations. 


- A + S sin  <J> 
P P 


COS  <t> 


with  the  steady  state  solution,  <j>  = tt  / 1,  A = p (see  Figure  1).  Thus  the  pump  reaches 

P P 

this  quasi  steady  state  shortly  (a  few  normalized  time  constants)  after  time  T = 0 and 
hence  phase  variation  of  4>p  is  not  expected  to  take  a significant  part  in  changing  these 
early-time  dynamics.  Later,  when  Aj  and  A^  become  significant,  one  might  expect 
4>p  to  vary  from  its  steady  state  value  (tt/2),  as  seen  from  Ecuation  (4d).  However, 
such  variation  would  be  small  since  the  two  terms  on  the  RHS  of  Eq.  (4d)  are  of  oppo- 
site signs  and  once  (4>.  + 4>9)  — • ir/2,  d4>  /dT  — ■ 0,  even  for  large  A.  and  A,.  Thus  var- 

l z p 12 

iation  in  initial  value  of  4>^  is  expected  to  play  little  role  in  changing  the  intermediate 
or  final  steady  state  dynamics. 

However,  the  intermediate  state  does  depend  on  the  initial  value  of  4>j  + 4>^  and 
numerical  methods  may  be  employed  in  order  to  obtain  the  actual  values  of  aj  and  a^,. 
Even  before  obtaining  numerical  results,  one  can  make  some  qualitative  deductions  on 
examining  the  Equations  (4b)  and  (4c).  Since  the  parametric  amplification  term  is 
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proportional  to  cos($j  + - <)>  ),  one  expects  that  the  closer  the  initial  ( <j> ^ + <p^)  is  to 

the  final  steady  state  value  (tr/2),  since  <)>  = (ir/2),  the  more  will  be  the  amplification. 
In  other  words,  the  initial  <f> ^ + <j>£  = it  /2  curve  for  Aj  will  lie  above  the  other  curves; 
with  the  initial  <j>j  + <t>^  = ir  /2  curve  being  at  the  bottom.  Such  behavior  is  in  fact  con- 
firmed by  the  numerical  solutions  shown  on  Fig.  1,  for  the  initial  phases  listed  in 
Table  I. 


Also,  on  changing  <|>j  + ~ it  - ( ^ ^e  equations  remain  unchanged.  Thus, 

any  initial  ( <f> ^ + <f>^)  and  ir  - (<j>j  + <\>^)  will  result  in  the  same  dynamical  behavior.  This 

is  also  shown  by  the  numerical  result.  Figure  1 shows  the  dynamical  behavior  for 

(4>1  + 4> 2)  initial  value  varying  from  ir  — -3ir/4.  It  is  found  (Table  I)  that  the  range 

w / 2 — -n/2  covers  the  entire  set  of  values  as  far  as  the  values  A , A,  and  A , are  con 

P 1 2 

cerned,  and  the  curves  lie  within  a narrow  bandwidth  of  each  other. 


It  is  to  be  noted  that  on  changing  i0- 


(3,  one  obtains  an  analogous  set  of  equations. 

1/2 


These  equations  have  similar  steady  state  solutions  with  a = 1,  A,  = (0-1  /a  V C and 
1/2  P 1 1 

A.  = (0-l/o  ) . The  only  differences  is  in  the  steady  state  values  of  the  phases. 


which  are:  <j>^  =0,  <t>^  + = 0,  respectively. 
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THEORY  OF  THE  INTEGRATING  SPHERE  FOR  PULSED  LIGHT  SOURCES 
K.  Park  and  W.  T.  Walter 

An  integrating  sphere  is  frequently  used  in  the  measurement  of  the  diffuse  reflec- 
tance of  a surface  because  of  its  ability  to  collect  light  reflected  into  all  angles  by  the 
target  surface.  A continuous  light  source  is  employed  in  these  measurements  so  that 
time  delays  caused  by  multiple  reflections  within  the  integrating  sphere  affect  only  the 
initial  detector  response  and  not  the  steady  state  values.  In  examining  the  reflectance 
change  of  a surface  during  a short  intense  pulse  of  light  such  as  a Q- switched  laser 
pulse,  however,  the  effect  of  multiple  reflections  must  be  taken  into  consideration. 


Not  only  does  an  integrating  sphere  carry  out  a spatial  integration  of  light  re- 
flected from  a target  surface,  but  it  delays  the  arrival  of  light  at  the  detector  by  vary- 
ing times  depending  on  the  path  within  the  sphere.  Thus  the  temporal  shape  of  the 
detected  light  signal  may  be  significantly  modified.  For  example,  we  have  found  ^ that 
a 30-ns  FWHM  (Full  Width  at  Half-Maximum  intensity)  Q-switched  ruby  laser  pulse 
reflected  from  a target  surface  placed  at  the  sample  port  of  a 20  cm  diam  integrating 
sphere  was  lengthened  by  - 10  ns  when  measured  by  a fast  vacuum  photodiode  located 
at  the  detector  por‘  (Figure  1). 


PULSED 

LASER 


BEAM  ENTRANCE 
PORTS 


Fig.  1.  Total  reflectance  measurement  of  a metal 
surface  using  an  integrating  sphere. 


At  first  thought  it  might  seem  that  if  the  same  process  were  applied  to  a sample 
of  the  incident  light  signal,  then  the  ratio  of  the  two  output  signals  from  the  integrating 
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sphere  would  still  yield  the  reflectance  of  the  target  surface.  It  will  be  pointed  out  in 
this  report  that  this  is  only  true  for  slowly  changing  reflectances. 

So  far  as  the  authors  are  aware,  no  theoretical  analysis  of  the  integrating  sphere 

2 

for  a time-dependent  light  source  is  available.  Jacquez  and  Kuppenheim  give  the  gen- 
eral theory  of  the  integrating  sphere  for  time- independent  light  sources  using  an  inte- 
gral equation  formulation.  They  solved  the  equation  using  Fredholm's  method.  In  this 
report  their  theory  is  extended  to  include  time- dependent  light  sources,  such  as  fast 
laser  pulses. 

The  theory  of  the  integrating  sphere  is  the  theory  of  multiple  reflections  in  a cav- 
ity. Consider  an  integrating  sphere  with  radius  R whose  interior  wall  is  a perfect  dif- 
fuse reflector  (i.e.,  all  points  on  the  surface  have  constant  radiance).  A light  pulse 
with  power  PQ(t)  [watts]  is  directed  into  the  cavity  through  an  entrance  window  (Fig.  1) 
onto  the  wall  where  it  begins  a path  of  multiple  reflections  before  collection  by  the 
detector.  We  shall  assume  that  the  total  area  of  the  various  ports  is  small  compared 
to  the  total  area  of  the  sphere  and  that  the  reflecting  surface  is  perfectly  spherical. 

Using  the  spherical  coordinates  (9,40,  we  sh^ll  represent  by  I( t,  0 , <p)  the  irradi- 

ance  at  (0,  40  and  at  time  t caused  by  the  reflected  flux  in  the  sphere.  Following  Jacquez 
2 

and  Kuppenheim  we  shall  treat  the  irradiance  striking  an  area  da  on  the  sphere  as  a 
sum  of  two  components,  IQ(t,  0,  40  representing  the  primary  incident  light  pulse  and 
I(t,0,4O  representing  the  light  reflected  by  the  remainder  of  the  sphere  to  da.  The 
quantity  iQ(t,  0,  <p)  will  be  zero  for  all  points  on  the  wall  of  the  sphere  except  for  the 
small  area  directly  irradiated  by  the  incident  light  beam.  Let  a constant  r denote  the 
total  reflectivity  at  any  point  on  the  sphere.  Then  r(IQ  + I)  is  reflected.  Since  all  points 
on  the  surface  are  assumed  to  reflect  perfectly  diffusely,  any  element  area  of  da  (Fig. 

2)  has  constant  radiance  N and 

N = [Io(t,0.4O  + I(t,  0,  40]  f . (1) 

Now  the  flux  radiated  from  one  area  da'  to  a second  area  da  is  given  by  N'dfl  da'  cos  a 
where  df2  is  the  solid  angle  subtended  by  the  area  element  da  and  cos  a is  the  Lambert- 
ian factor  (Figure  2).  From  Fig.  2 and  Eq.  (1)  we  can  show  that  the  radiated  flux  from 
da ' to  da  is 

_r  -i  (_P*n)  ( - p • n1) 

-[lo(t.e',V)  + I(t.e',4»’)J  da— 3-  da' -=-■=_  (2) 

P W 

On  the  sphere. 


(_£.n)  (-_p.  n') 
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Fig.  2.  Vector  relations  between  area  elements  on  the  sphere. 

Therefore,  the  irradiance  at  (t,8,4>)  is  given  by 

l(t,0,b)  = r-  ? f I (t-At,  6',  <t>')  da 1 + ■ f y f I(t  - At',  0',  4>')  da 1 (3) 

4nR2  S ° 4nR*  * 

where  At,  is  the  time  for  light  to  travel  from  (O',  4>')  to  (6,  <j>)  and  is  given  by 

...  p 2R cos  a 

At  = = 

c c 

where  c is  the  speed  of  light. 

The  first  integral  on  the  right-hand  side  of  Eq.  (3)  gives  r P (t)/ 4ir  R^  where  P^t) 
is  the  incident  light  power.  Here  an  approximation  has  been  made  that  I^ft  - At,  0',4>')~ 
I (t,  0',  4>')  which  is  valid  for  a laser  pulse  whose  FWHM  is  much  larger  than  R/c.  For 
a 20  cm  diam  integrating  sphere,  p/c  < 0 . 7 ns  . 

Thus,  we  have  a time-delayed  integral  equation: 

I(t,  0,  4>)  = r I (t)  + — L- ? £ I(t  - At  , 0 ',  <*>')  da  ' (4) 

S 4 n R^  S 


where 


I 

s 


(t) 


4ttRZ 


Since  Ig(t)  is  not  a function  of  position  on  the  surface  of  the  sphere,  the  only  de- 
pendence of  I(t,  0,  4>)  on  position  is  through  the  time  delay  At(0,  q>*  0',  4>')-  If  we  take  the 

observation  point  (0,  b)  at  the  north  pole  (any  point  is  equivalent  on  the  sphere  for  our 

2R  0' 

assumptions),  then  the  b'  dependence  disappears  since  At  = sin  — (see  Figure  3), 
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Fig.  3.  Relations  between  the  observation  point  and  the  source  point. 
Now  substituting  into  Eq.  (4)  we  find  that 

TT  of 

I(t)  = r Ig(t)  + / I(t  - sin  j ) sin  0'  d0' 

o 

where  a new  time  unit  of  2R/c  has  been  employed. 

Changing  the  variable  in  Eq.  (5)  by  putting  sin  ~ = x,  then  we  obtain 

A1  6’  / 2 

sin  G1  = 2 sin  — cos  - 2x  y 1 - x 


dG’  = 


2 dx 

IT 7 


(5) 


and  Eq.  (5)  becomes 

1 

I(t)  = r I (t)  + 2 r f I(t  - x)x  dx  . (*>) 

o 

At  this  point  we  shall  consider  absorption  and  aperture  losses  of  the  integrating 

sphere.  Let  the  total  area  of  the  sphere  be  SE  4nR2,  the  spherical  area  removed  for 

the  detector  port  be  b,  that  for  the  entrance  ports  a,  and  that  for  the  sample  port  c. 

Then,  if  d = S-  (a  + b + c),  we  may  define  an  equivalent  average  sphere  reflectance 

l - rd/S  + r c/S  where  r is  the  average  reflectance  of  the  sample  attached  to  the  sam- 
' s s 

pie  port.  The  quantity  (1  - l ) accounts  for  total  losses  due  to  the  absorption  by  the 
integrating  sphere  wall  and  target  sample  as  well  as  the  loss  due  to  escape  of  light  flux 
through  the  various  holes.  Thus,  Eq.  (6)  can  be  written 
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I(t)  = rl  (t)  + 2i  I I(t-x)xdx 


(?) 


Equation  (7)  could  be  solved  by  a direct  substitution  method.  However,  since  the 
integration  interval  is  very  short  compared  to  the  pulse  width,  instead  of  directly  sub- 
stituting into  the  integral  of  Eq.  (7),  we  may  analytically  carry  out  the  integration  in  the 
following  way.  A linear  approximation  to  I(t-  x)  is  used  over  the  range  from  x = 0 to 
x = 1 . Then 

I(t-  x)  = [I(t-  1)  - I(t)]  x +I(t)  . 

Now,  substituting  the  approximate  value  of  I(t  - x)  into  Eq.  (7)  and  carrying  out  the  inte- 
gration, we  obtain 


(1  - )I(t)  = rls(t)  + j £ I(t-  1)  . 

Finally,  we  may  express  I(t)  as 
I(t)  = c 1Is(t)  + c2I(t-  1) 

where 


3r 

C1  “ 3-  i ’ 


2 i 


(8) 


(9) 


and 


2 " 3-  f ' 


PQ(t) 

- -ir 


, r C 

t = + _®_ 
' s s 


To  obtain  I(t)  from  Eq.  (9)  we  follow  an  iterative  procedure  as  follows: 
I<0)(t)  = Cjl8(t) 

I(1)(t)  = ClIg(t)  + c2I(0)(t-  1) 
s clIs(t) 

I(2)(t)  = Cjls(t)  + c2I(1)(t-  1) 
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clIs(t)  +clc2Is(t*  l)  + clc2Is(t'  2) 


I(n,(t)  = ClIs(t)  + c2I(n'  1J(t-  1) 


= Cl[ls(t)  + c2Is(t-  1)  +c2Is(t-2)  + ...  +cjlg(t-n)] 


(10) 


21 


This  series  converges  for  c2  = ^ _-j  < 1,  i.e.,  for  l < 1 . Therefore,  whenever  there 
is  a loss  in  the  integrating  sphere  (which  is  always  the  case)  the  series  converges. 
From  the  last  expression  of  Eq.  (10)  we  can  clearly  see  how  the  incident  pulse  Ig(t) 
transforms  itself  into  a longer  pulse  for  each  reflection  inside  the  sphere. 

Conversely,  we  shall  now  address  the  more  important  question  of  whether  it  is 
possible  to  recover  the  original  pulse  from  the  detected  pulse  which  emerges  from  the 
integrating  sphere.  Equation  (9)  can  be  used  to  find  the  answer  without  having  to  trace 
back  through  the  iterations  indicated  in  Equation  (10).  From  Eq.  (9)  then, 


I_ (t)  = ~ [I(t)  - c_I(t-  1)]  , 


1 

and  therefore  the  inverse  problem  is  actually  the  easier  one.  For  pulses  whose 
2.  R 

FWHM  - -7-  or  less,  however,  Eq.  (7)  must  be  used  to  find  Ig(t).  Then, 

1 


(11) 


I (t)  =7  [I(t)  - 2f  / I(t-  x)xdx] 


(12) 


The  output  signal  V(t)  of  a suificiently-fast  photodetector  is  given  as  V(t)  = Dbl(t) 
where  D is  the  detector  responsivity  in  volts/watt  and  b represents  the  area  of  the 
detector.  Using  Ig(t)  = P^(t)/(4irR  ) we  obtain  from  Eq.  (11): 


Po(t) 


V(t-  ^)] 


(13) 


where  P (t)  Is  the  original  light  flux  entering  the  integrating  sphere  and  we  have  returned 
to  the  conventional  time  unit  of  seconds. 

4 

For  a 20cm  diam  integrating  sphere  coated  with  a barium  sulfate  coating  having 
99%  reflectance  at  700  nm  and  with  holes  for  the  detector,  target  sample  and  beam  en- 
trance port  amounting  to  3%  of  the  total  surface  area  of  the  sphere 


t - .96,  Cj  - 1.46,  c2 


.948  . 
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Now  from  Eq.  (13) 

VQ(t)  = 57[V(t)  - . 948  V(t- 0.67)]  (14) 

where  VQ(t)  is  the  voltage  that  would  appear  on  the  photodetector  output  if  there  were 
no  delay  due  to  multiple  reflections  within  the  integrating  sphere.  In  Eq.  (14),  t is 
measured  in  nanoseconds.  Equation  (14)  can  be  used  to  recover  the  voltage  V (t)  of  a 
light  pulse  entering  the  integrating  sphere  from  the  voltage  V (t)  displayed  on  the  oscil- 
loscope from  the  detector. 

Summation  of  multiple  reflections  resulting  in  a final  output  pulse  from  the  inte- 
grating sphere,  according  to  the  theory  just  developed,  is  shown  in  Fig.  4,  where  the 

16.8 

12.0 
!<T> 

8.03 

4.00 

0.0 


Fig,  4.  Development  of  an  output  pulse  from  the  integrating  sphere, 

development  of  an  output  pulse  is  shown  by  successive  pulse  curves.  The  smallest 
pulse  curve  is  a portion  of  the  original  pulse  intensity,  that  is,  Is{t),  the  component 
directly  reflected  into  the  detector.  Each  successive  curve  shown  represents  five  iter- 
ations (in  Equation  (10)).  A total  of  40  iterations  carried  out  on  a PDP  11/40  mini- 
computer are  displayed  in  Figure  4.  The  loss  (1  - f ) was  taken  as  5%.  After  - 100 
iterations  the  pulse  shape  has  reached  a steady  state. 

In  Fig.  5(a)  and  (b),  the  development  of  the  lengthened  pulses  exiting  from  the 

sphere  and  the  recovery  of  the  original  pulses  entering  the  sphere  are  illustrated  for 

pulse  curves  which  have  been  chosen  to  model  the  experimental  results  of  Bonch- 
6 7 

Bruevich  and  Zavecz.  They  found  that  the  reflectance  of  metal  surfaces  decreased 
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dramatically  during  an  intense  laser  pulse.  To  model  the  results  of  Bonch-Bruevich 
and  Zavecz  we  assume  a reflectance  curve  consisting  of  an  exponential  decrease  to  a 
plateau  reflectance  of  0 . 5 followed  by  a second  exponential  decrease.  A Gaussian  in- 
put laser  pulse  is  assumed.  Then  the  reflected  pulse  modeled  for  this  situation  is  the 
product  of  the  Gaussian  laser  pulse  and  the  assumed  reflectance  curve. 

In  Fig.  5(a)  the  reflected  pulse  (dotted  lower  pulse  curve)  and  the  sample  incident 
laser  pulse  (dotted  upper  pulse  curve  normalized  to  a peak  intensity  of  1)  are  shown  to- 
gether with  their  ratio  (dotted  curve  beginning  with  a reflectance  of  1,  going  through  a 
plateau  at  R - 50%  and  decreasing  to  -20%  at  the  end  of  the  pulse).  Applying  Eq.  (10), 
we  find  that  the  reflected  and  sample  pulse  curves  develop  into  the  solid  curves  shown 
in  Fig.  5(a)  when  they  emerge  from  the  sphere.  Note  the  change  in  the  reflectance 
curve.  The  spatial  integration  carried  out  by  the  integrating  sphere  also  smooths  the 
sharp  temporal  reflectance  changes  in  addition  to  lengthening  the  pulse. 

Figure  5(b)  shows  the  inverse  process.  In  this  case  the  output  reflected  and  sam- 
ple pulses  from  the  integrating  sphere  are  dotted  indicating  they  are  the  initial  conditions. 
Recovery  of  the  input  reflected  and  sample  pulses  entering  the  integrating  sphere  is 
obtained  using  Eq.  (11)  and  is  indicated  by  the  solid  curves  in  Figure  5(b).  No  iteration 
is  necessary  in  this  case.  The  overlap  of  the  solid  curves  in  Fig.  5(b),  with  the  dotted 
curves  of  Fig.  5(a),  confirms  the  procedure  and  its  inverse  as  developed  in  this  report. 

Notice  that  although  the  integrating  sphere  is  a linear  device,  which  is  apparent 
from  Eqs.  (7)  and  (10),  the  ratio  between  two  different  time- dependent  light  sources  is 
not  preserved  through  the  integrating  sphere  measurement  process.  Therefore  to  ob- 
tain reflectance  histories,  for  example,  for  a metal  surface  under  intense  irradiation 
by  a laser  pulse,  the  reflected  and  incident  pulse  shapes  must  be  recovered  according 
to  Equations  (11)  or  (12).  Pulse-lengthening  is  most  pronounced  for  short  pulses.  A 
delta -function  input  6(t)  would  acquire  a pulse  width  proportional  to  the  integrating 
sphere  cavity  time  constant.  The  integrating  sphere,  therefore,  carries  out  a time 
integration  as  well  as  a spatial  integration  for  time- dependent  light  sources.  The  time- 
integration  is  hidden  in  the  spatial  integration  because  the  delay  time  between  various 
reflections  is  really  a function  of  position  on  the  sphere. 
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TWO-PHOTON  CORRELATIONS  IN  AMPLIFIED  LASER  LIGHT 
G*  D.  Blake  and  D.  B*  Scarl 

We  have  measured  the  two-photon  correlation  function  in  632-  8 nm  single  mode 
He-Ne  laser  light  which  had  been  amplified  by  stimulated  emission  in  a long  He-Ne 
discharge  tube*  For  laser  light  intensities  which  were  comparable  with  the  discharge 
tube  spontaneous  emission  intensity  into  a single  mode,  the  measured  correlations 
agree  well  with  those  predicted  for  a simple  mixture  of  laser  and  chaotic  light.  The 
results  are  consistent  with  the  assumption  that  laser  light,  linearly  amplified  by 
stimulated  emission,  has  the  same  statistical  properties  as  unamplified  laser  light. 

A«  Introduction 

Although  the  statistical  properties  of  simple  electromagnetic  fields  are  well 
understood,  the  interpretation  of  field  measurements  in  terms  of  elementary  physical 
concepts  is  not  always  straightforward*  One  of  several  sources  of  this  difficulty  is 
that  the  mode  operators  into  which  the  field  operators  most  easily  separate  have 
eigenfunctions  whose  squares  are  independent  of  position  and  time*  Using  these  mode 
operators,  the  usual  interpretation  of  the  field  in  terms  of  photons  yields  photons  that 
are  eigenstates  of  the  Hamiltonian  and  therefore  independent  of  time* 

Photons  that  are  excitations  of  a single  field  oscillator  at  a single  frequency 
necessarily  extend  over  all  time,  a requirement  that  is  difficult  to  reconcile  with  the 
creation  or  destruction  of  these  photons*  On  the  other  hand,  the  photons  that  partici- 
pate in  experiments  seem  to  be  created  in  the  neighborhood  of  one  space  time  point, 
to  propagate  occupying  a finite  region  of  space -time,  and  to  be  detected  in  the 
neighborhood  of  another  space-time  point*  It  is  these  photons  which  carry  information, 
excite  atoms,  show  interference  effects,  and  display  photon  statistics* 

The  properties  of  photons  are  intimately  connected  with  their  interaction  with 
atoms  and  other  massive  objects*  The  fundamental  processes  of  absorption, 
spontaneous  emission,  and  stimulated  emission  seem  to  exist  in  all  sources,  con- 
verters, filters,  and  detectors*  The  dynamics  of  these  processes  and  their  connection 
with  the  properties  of  the  absorbed  or  emitted  fields  has  been  one  of  the  most  beauti- 
ful applications  of  quantum  mechanics  in  the  past  two  decades*  However,  the  time 
dependence  of  these  transitions  is  usually  described  without  the  use  of  temporally 
localized  photons* 

Stimulated  emission  illustrates  one  of  the  apparent  difficulties  of  current  photon 
descriptions  of  the  electromagnetic  field*  When  an  identifiable  incident  photon 
stimulates  an  emitted  photon,  the  incident  and  emitted  photons  might  be  expected  to  be 
related  in  time*  Previous  attempts  to  detect  these  time  correlations  in  amplified 
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chaotic  light  have  shown  no  correlations  in  excess  of  the  HBT  effect-  * Since  the  HBT 
effect  shows  the  same  ratio  of  correlated  photon  pairs  to  uncoi  related  photon  pairs, 
independent  of  the  geometry  of  the  source  or  the  light  intensity  which  the  atoms  see, 
it  is  difficult  to  attribute  the  HBT  effect  itself  entirely  to  stimulated  emission.  It  can 
always  be  attributed  to  random  correlations  between  photon  pairs,  2’  ^ although  this 
point  is  still  under  discussion-  ^ 

In  particular,  single  mode  laser  light,  which  can  be  described  by  a coherent 
state  of  the  electromagnetic  field,  shows  no  HBT  effect-  Linearly  amplified  laser 
light,  which  can  still  be  described  by  a coherent  state,  although  having  been  amplified 
by  stimulated  emission,  would  therefore  be  expected  to  show  no  HBT  effect  and  no 
time  correlations  between  photon  pairs- 

We  have  passed  a beam  from  a single  mode  laser  through  an  amplifier  tube  in 
which  stimulated  emission  increases  the  intensity  of  the  laser  beam  by  30%,  and  have 
looked  for  time  correlations  between  pairs  of  successive  detected  photons  in  the  out- 
put light-  The  amplified  laser  beam  was  of  approximately  the  same  intensity  as  the 
spontaneously  emitted  light  from  the  gain  tube  in  the  same  spaceangle  mode-  Under 
these  conditions  every  stimulated  photon  generated  by  the  original  laser  beam  is  ac- 
counted for  in  the  amplified  laser  light;  there  is  no  other  stimulated  emission  associ- 
ated with  the  laser  light.  The  observed  photon  correlations  can  be  attributed  to 
second  order  interference  between  pairs,  one  of  which  is  from  the  amplified  laser 
light,  and  one  of  which  is  from  amplified  spontaneous  emission,  or  both  of  which  are 
from  amplified  spontaneous  emission-  No  correlations  between  incident  and  induced 
photons  seem  to  be  necessary  in  order  to  explain  the  results- 

B-  Counting  Rate  for  Point  Detectors 

For  a beam  with  stationary  statistics,  the  two-photon  counting  rate  for  two  in- 
finitesimal detectors,  one  of  which  samples  the  field  at  the  space-time  point 
Xl  = ancl  the  ot^er  of  which  samples  the  field  at  the  point  x ,,  is  pr  oportionaf  0 

to  the  second  order  correlation  function: 

G(2,(xt,  x2)  = < A+(x1)A+(x2)A(x,)A(x2)  > 

= Trp  A (x1)A+(x2)A(x1)A(x2)  (1) 

where  A(x  ) is  the  field  operator  at  the  point  x and  p is  the  field  density  operator- 
The  field  operators  can  be  expanded  in  a set  of  discrete  orthogonal  modes,  q(k.,x) 
with  the  mode  amplitude  operators  aj  and  propagation  constants  k.  = (k.,u.)  • 


176 


QUANTUM  ELECTRONICS 


A(x  ) 


£ a.q(k.,  x) 

i 


(2) 


The  field  density  operator  in  the  present  experiment  in  which  the  amplified  laser 
light  has  approximately  the  same  intensity  as  spontaneous  emission  from  the  amplifier 
tube,  describes  a mixture  of  amplified  laser  and  chaotic  (gain  tube  spontaneous 
emission)  light*  The  amplified  laser  light  is  assumed  to  have  the  same  density 
operator  as  ordinary  unamplified  single  mode  laser  light-  The  chaotic  light  has  the 
usual  maximum-entropy  density  matrix-  The  density  matrix  for  the  mixture,  written 
in  a basis  made  up  of  the  coherent  states  for  each  mode,  | a ^ > is*  * 


P = " JPt(a f)  Uj  ><  at\  d 


(3) 


It  is  convenient  to  let  the  mode  l = 1 be  that  which  contains  the  laser  light-  For 
this  mode 

- |a1-ao|2/<n1> 

e (4) 


P(ot) 


1 


" < n,  > 


where  a 


is  the  average  number  of  photons  in  the  amplified  laser  beam  and  < n^> 
is  the  number  of  chaotic  photons  in  the  amplified  spontaneous  emission  mode  with  the 
same  central  frequency  and  propagation  vector  as  the  laser  light- 

For  all  other  modes  there  is  spontaneous  emission  alone  and  the  weight  function 


P|(<V  = 


-U|  I /<n( 


it  < n j > 


(5) 


where  <n^>  is  the  average  number  of  spontaneously  emitted  photons  with  frequency 
Uj  and  propagation  vector  • 

The  second  order  correlation  function  can  be  written 


C(2)(xt,  x2)=Tr*JPf(ai)  | ai\rakamq^k.,rt)q«(k  r2)q(l<k,x1)q(kMx2»i:oi 

( i.J.k.m  J J 


+ + 


= f il  Km^iajak°mPf(af)^h',S)(l,i!(^x2)ci(kkxl)c’(km’x2) 


(6) 


Equation  (6)  is  also  the  classical  expression  for  the  second  order  correlation  function 
of  a field  with  mode  amplitudes  a whose  distribution  functions  are  P (a)  • 
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Using  the  fact  that  integrals  containing  unpaired  a's  are  zero,  the  sums  and 
products  of  Eq-  (6)  reduce  to 


G(2)(x1(x2)=  V /|a.|4  P (a  , d2o.  + 

l 

l^/la.^P.^.JdVl2  + 


!2/lail2Pi(ai)d2ai  <i*(ki'xi)  q(ki.x2>l 


(7) 


After  considerable  rearrangement,  reconstruction  of  sums,  and  replacement  of  sums 
by  integ  rals,  * 2 


G(2)(x(  x2)=(Il+Ic)2+I2|g(1)(x1,x2)|2  + 2ILIcReG(t)(x1,x2) 


where 


‘c*  j 


L-i*.' 


...  . ik(x  -x  ) 

G (x2»xi)=  k /<  nk  > 6 dk  ' 


(8) 

(9) 

(10) 

(ID 


Ij^  and  1^,  are  the  laser  and  chaotic  light  intensities-  G^(Xj,x2>  is  the  first  order 
correlation  function  for  the  field-  For  a stationary,  cross -spectrally  pure  field,  the 
correlation  functions  depend  only  on  coordinate  differences  and  the  first  order  cor- 
relation function  factors  into  separate  functions  of  time  and  of  space: 


G(2)(Ar,  At)  = IL+IC)2  [ 1+  C 


Y |G(1)(Ar)|2|G(1)(At)|2  + 


2 — — rr  Re  G(1)(Ar)G(1)(At)  ] - (12) 

<IL+IC> 

The  two-photon  counting  rate  can  be  interpreted  as  a background  term,  independent 
of  the  detector  spacing,  a term  arising  from  correlations  between  pairs  of  chaotic 
photons,  and  a term  arising  from  correlations  between  chaotic  photons  and  photons  in 
the  amplified  laser  beam- 
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C.  Expected  Counting  Rate  for  Real  Detectors 

The  expected  counting  rate  for  detectors  which  have  a finite  spatial  extent  and  a 
finite  time  resolution  can  be  gotten  by  integrating  over  all  pairs  of  points 
in  the  detector  aperture  and  convolving  with  the  detector  two-photon  time  resolution 
function. 

After  integrating  over  the  second  aperture,  convolving  with  the  photomultiplier 
time  response  and  taking  account  of  noise,  filter  leakage,  and  detector  efficiency,  the 
expected  counting  rate  is 


-At2/2^ 

R = R f 1 + C.  e 
o 1 1 


+ C2  e 


- At  2/2<t2 


where 


R =1“  e.e,T 
o 12 


1 = rL  + XC  + XN  + JW 


cz-sz7-z  -7#  <17' 

Here  and  are  the  detection  efficiencies  of  the  two  detector  chains,  T is  the 

width  of  one  PHA  time  bin,  1^,  1^,  and  1^  are  the  number  of  photons  per  second 
originating  from  the  laser,  from  chaotic  632-  8 nm  light  from  the  amplifier,  and  from 
chaotic  unwanted  633.  4 nm  light  from  the  amplifier.  is  the  mean  number  of 

photomultiplier  noise  counts  divided  by  The  correlation  terms,  Cj  and  are 

the  correlated  events  (as  a fraction  of  Rq)  that  arise  from  pairs  of  chaotic  632*  8 nm 
photons  and  from  632.  8 nm  chaotic  and  632-  8 nm  laser  photons- 

The  spatial  coherence  factors  and  arise  from  integrals  of  G^(Ar)  and 
|c^'^(Ar)|2  over  the  detector  aperture.  They  are  evaluated  and  plotted  in  References 
13-15*  <r  is  the  standard  deviation  of  the  Gaussian  first  order  time  correlation 
function  for  the  chaotic  light,  and 


(a/-/2)  + tr‘ 


(18) 
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7z  1231 

as  a result  of  convolution  of  G^(At)  and  |G^(At)|^  with  the  photomultiplier  time 
response  function,  whose  width  is  <r  . 

D-  Results 


Figure  1 shows  the  results  of  four  experimental  runs  with  amplified  laser  to 
chaotic  intensity  ratios  of  0,  *5,  1 and  2*  No  adjustments  of  any  kind  have  been  made 
to  the  data*  The  solid  line  in  each  case  is  a plot  of  Eq.  (17)  using  the  parameters  of 
Table  I which,  except  for  R , were  all  determined  independently  of  the  experimental 
runs*  In  all  cases  the  agreement  between  predicted  and  measured  values  is  within 
1%,  indicating  that  the  time  correlation  to  be  expected  from  amplified  laser  light  is 
the  same  as  that  predicted  for  unamplified  laser  light  of  the  same  intensity  as  the 
amplified  light,  mixed  with  chaotic  light  whose  intensity  is  that  of  the  unavoidable 
spontaneous  emission  from  the  amplifier- 
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TABLE  I.  Measured  experimental  parameters. 


Eatio 

*0 

cpe 

Xo 

cpe 

XL 

cpe 

** 

cps 

I 

V 
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S 

0:1 

35,1*35 

60321 

0 

IO36 

912 

0 

i:1 

32,523 

622U6 

281,91 

1052 

91*1* 

1.33 

1:1 

35,1*00 

61277 

60152 

1309 

930 

1.28 

2:1 

50,1*85 

60116 

120233 

1150 

906 

1.29 

Fig.  1.  Experimental  counting  rates  and  predicted  curves  for  four  runs  in  which  the 
ratios  of  amplified  laser  light,  Ij^.  to  amplifier  tube  chaotic  light  in  the  same 
mode,  Iq,  were  0,  .5,  1 and  2.  The  experimental  statistical  errors  and  the 
uncertainty  in  the  predicted  curve  arc  each  1%. 
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METALLIC  FIELD  EFFECT  AT  FREE  METAL  SURFACES 
H.J.  Juretschke,  E.  Segredo  and  M.  Eschwei 

In  the  past  years  we  have  carried  out  extensive  work  on  the  metallic  field  effect 
(MFE)  at  metal  insulator  interfaces.  As  shown  in  the  preceding  report,  and  some 
earlier  ones/  the  data  can  be  successfully  explained  in  terms  of  char ge- induced  stresses 
and  strains  at  the  interface  caused  partly  by  the  Maxwell  stresses  accompanying  elec- 
tric fields  (and  proportional  to  q^,  where  q is  the  surface  charge  density  at  the  inter- 
face), and  partly  by  surface  piezoelectric  effects  (proportional  to  q)  allowed  by  the 
lower  symmetry  of  the  structure  of  the  immediate  interface  region. 

It  is  of  great  interest  to  extend  such  work  to  free  metal  surfaces  because  the 
charge- induced  strains  are  then  only  functions  of  the  metal  surface,  and  not  subject  to 
any  mechanical  constraints  of  the  abutting  dielectric.  Under  such  conditions,  the  sys- 
tem under  study  is  reduced  to  an  electric  field  terminating  on  a metal  surface.  The 
elastic  response  to  such  a field  by  the  metal,  especially  at  very  low  temperatures,  has 
recently  become  a subject  of  investigation  in  connection  with  the  shielding  of  fields  of 
freely  falling  electrons  in  metallic  containers  Furthermore,  the  influence  of  the  ad- 
sorbed surface  layers  on  this  shielding,  expected  to  be  considerable,  has  never  been 
studied . 

As  a first  step  towards  such  study,  we  have  developed  parallel- plate  vacuum  ca- 
pacitor structures  of  very  small  interplate  spacing  such  as  to  allow  fields  of  the  order 
of  105  to  10^  volts/cm  in  the  gap,  and  mechanically  supported  to  prevent  collapse  of 
the  capacitor  under  such  strong  electrostatic  forces. 

Because  of  their  reduced  capacitance  relative  to  metal- dielectric- metal  sand- 
wiches, these  systems  are  expected  to  give  an  appreciably  lower  metallic  field  effect 
signal,  at  the  maximum  field  allowed  by  breakdown,  and  therefore  the  capacitor  plates 
were  redesigned  to  maximize  these  signals,  subject  to  various  practical  constraints  on 
size  and  sensitivity  of  our  measuring  system.  The  MFE  signal  is  given  by  an  incre- 
mental voltage  6V  of  the  form 

6V  = -In  Rq  6 

where  I is  the  current  flowing  through  the  capacitor  plate,  n is  the  number  of  

of  the  plate  between  the  measurement  electrodes,  RQ  is  the  area  resistance  it 
(inversely  proportional  to  the  plate  thickness)  and  62?Q  is  the  specific  change 
surface  conductance  with  charging,  our  unknown.  With  all  three  parameter  - 
Rg  at  our  disposal  we  have  been  able  to  enhance  our  sensitivity  suffit  iont!\  •.  • . 
to  see  the  MFE  under  free  surface  conditions.  We  report  here  our  hr'!  e 
effect  of  charge  on  silver  electrodes  exposed  to  the  atmosphere 
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The  depth  profile  of  our  sample  arrangement  is  illustrated  schematically  in  Fig- 
ure 1.  A silver  sample,  grown  epitaxially  on  thin  mica,  is  supported  on  optically  flat 


GLASS  SUBSTRATE  3.17mm 


MICA  SUBSTRATE  13mm 


OPTICALLY  FLAT  GLASS 


1 


I 


Fig.  1.  Cross  sectional  view  of  vacuum- spaced 
field  effect  capacitor. 

glass,  and  a counter  electrode  of  silver,  on  another  glass  plate,  is  kept  at  the  desired 
separation  by  a set  of  spacers  of  alumina  sputtered  onto  the  second  glass  plate.  In 
top  view,  the  structure  has  the  form  shown  in  Figure  2.  The  thin  electrode  ribbon  is 


Counte  r electrode 
Silver  1700A 

Fig.  2. 


Spacer  film  Silver  Sample 

A1,0,  Sfj  Silver  500A 

L 3 plus  "Leveling"  Film 

(Under  ALO.) 

Thin  film  configurations  of  vacuum- spaced 
field  effect  capacitor  package. 


in  the  form  of  connected  half  circles  in  order  to  be  able  to  surround  each  portion  of  the 
sample  by  supporting  spacers,  as  well  as  to  increase  the  total  length  of  the  sample 
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between  the  measuring  electrodes.  Elaborate  masks  have  been  necessary  for  the  Al^O^ 
to  permit  all  contact  electrodes  to  leave  the  sample  without  introducing  additional  points 
of  mechanical  contact  or  electrical  bypass  between  the  top  and  bottom  plates,  and  the 
whole  structure  must  be  assembled  in  careful  register. 

The  complete  structure,  under  operating  conditions,  gives  a parallel  plate  vacu- 

- 6 2 

um  capacitor  with  an  area  capacitance  of  about  2 • 10  F/m  , or  a plate  separation  of 

• 6 7 

about  4 • 10  m.  At  this  spacing  we  have  been  able  to  apply  fields  of  more  than  10 

volts/m  without  breakdown  or  mechanical  disturbance  of  the  geometry,  and  with  a 
sensitivity  enhanced  by  nearly  a factor  of  100  using  the  parameters  described  above, 
we  have  been  able  to  detect  satisfactory  and  reproducible  MFE  signals  on  silver  sam- 
ples . 

Our  data  give,  for  the  specific  surface  conductance  of  the  Ag/air  interface,  the 

value 

6Zq  /6q  = -3.0  • 10' 3 D'1  (C/m2)'1 

at  room  temperature.  This  number  is  smaller  than  that  for  Ag/mica  by  a factor  of 
about  five,  indicating  that  the  strain  induced  by  a given  surface  charge  density  at  the 
free  silver  surface  is  much  smaller.  The  number  is  comparable  to  what  we  have  ob- 
served for  Ag/mylar,  and  Ag/kapton,  suggesting  that  the  bonding  of  Ag  to  these  sur- 
faces is  weak.  In  terms  of  a simple  model  of  surface  piezoelectricity,  this  implies 
that  a free  silver  surface  is  much  less  distortable  by  charging  than  when  adhering  to 
mica.  This  is  probably  a consequence  of  the  equilibrium  structure  of  the  silver  surface 
region  in  the  different  environments. 

It  must  be  kept  in  mind,  of  course,  that  in  this  experiment  the  silver  surface, 
exposed  to  the  atmosphere,  is  not  clean,  and  that  we  are  really  seeing  the  effect  of 
charging  on  a gas  covered  surface. 

However,  this  preliminary  experiment  shows  that  the  MFE  is  measurable  on  such 
accessible  surfaces,  and  it  opens  the  way  to  a systematic  study  of  the  charge- induced 
strain  on  metal  surfaces  that  are  clean  or  covered  intentionally  by  various  layers  of 
adsorbed  or  deposited  material. 
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CHARGE-INDUCED  SURFACE  STRESSES  AT  METAL-INSULATOR  INTERFACES 
H.J.  Juretschke,  R.  Boucarut  and  E.  Segredo 

The  change  in  surface  conductance  of  metals  with  electrostatic  charging,  the  so- 
called  metallic  fie'd  effect,  has  been  observed  to  contain  two  contributions,  proportion- 
al to  q,  and  q , respectively,  where  q is  the  surface  charge  density.^  If  we  write 

62  = -Bgq  - Bq2  (1) 

then  the  coefficient  B has  found  a full  quantitative  explanation  as  measuring  the  elasto- 
resistance  of  the  sample  caused  by  strains  induced  in  the  sample  by  the  Maxwell  stresses 
in  the  dielectric.  We  can  therefore  write 

B = r sy  (2) 

where  the  conductance  £ appears  because  the  strains  in  the  sample  permeate  it  com- 
pletely and  make  the  q2-term  a bulk  effect.  The  constant  s measures  the  coupling  bet- 
ween the  strains  and  q,  and  y is  the  effective  elastoresistivity  coefficient  of  the  material 
for  the  given  strain  configuration.  Experimental  data  on  the  various  known  constants 
and  the  measurement  of  B give  satisfactory  agreement  between  both  sides  of  Equation 
(2). 2 

Largely  because  of  the  similarity,  as  well  as  characteristic  differences,  observ- 
ed in  the  behavior  of  B and  Bg  in  ferromagnetic  metals  under  the  variation  of  the  mag- 
netic parameters,  we  have  concluded  that  the  coefficient  B must  have  a similar  inter- 

s 

pretation  as  B.  However,  since  it  is  independent  of  the  thickness  of  the  metal  sample, 

Bg  must  be  related  strictly  to  a change  of  surface  conductance  caused  by  surface 
stresses  linear  in  q.  This  change  presumably  originates  in  charge-induced  changes  of 
the  surface  scattering  of  current  carriers,  but  at  least  phenomenologically  it  can  also 
be  described  in  terms  of  the  change  of  conductance  of  a thin  surface  layer  as  a function 
of  surface  strain. 

If  a surface  stress  X causes  strains  throughout  an  effective  thickness  t , then 
8 8 

we  can  formulate  this  model  in  analogy  with  Eq.  (2),  and  obtain 

Bsq  = <*t8)(SX8/t8)ys  (3) 

where  a is  the  conductivity  of  the  metal,  S is  the  metal's  elastic  constant  for  isotropic 
stress,  and  yg  is  the  surface  counterpart  of  y of  Equation  (2).  Combining  Eqs.  (2)  and 

(3),  we  can  use  the  measured  values  of  B and  B to  determine  the  surface  stress  X : 

s s 

Xg/q  = t(s/S)(Bg/B)(y  /yg)  (4) 

where  t is  the  sample  thickness,  introduced  to  eliminate  £ = et.  If  we  further  assume 
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that  the  strain  dependence  of  scattering  is  the  same  in  bulk  and  in  the  surface  layer, 
i.e.,  yg  = y,  then  all  quantities  on  the  right  side  of  Eq.  (4)  can  be  obtained  from  experi- 
ment, and  this  gives  a measure  of  the  stress  per  unit  charge  in  the  surface. 

The  result  of  combining  the  measurements  of  Bg  and  B with  the  constants  s and  S 
characteristic  of  various  substrates  and  metals  yields  the  values  of  Xg/q  shown  in 
Table  I. 

TABLE  I.  Surface  stresses  Xg/q  of  metal-dielectric  interfaces. 

(Units  of  (N/m)/(C/m^)) 


Substrate: 

Mica 

Mylar 

Glass 

Kapton 

s[in  (C/m^) 

2] 

.054 

sO 

00 

2.6 

. 77 

Metal 

S(10' 

1 1 m/N) 

Ag 

1 

. 28 

4.  3 

-0.35 

1 

o 

O 

-2.  3 

Au 

1 

. 28 

-1.3 

2.  3 

6.  1 

Ag/Au 

1 

. 3 

-0.074 

Cu 

0 

.87 

18 

Ai 

1 

.02 

1.3 

Sb 

1 

.01 

7.9 

NiFe  90- 10 

0 

.54 

2.0 

80-20 

0 

.54 

1.5 

The  central  result  of  this  table  is  that  all  surface  stresses  obtained  in  this  man- 
ner are  close  to  the  order  of  magnitude  ( - 1(  N/m)(C/m^)  predicted  by  theoretical 
4 

estimates.  In  more  detail,  however,  there  are  significant  and  surprising  variations 
of  this  stress  between  different  metal/substrate  combinations,  ranging  over  more  than 
one  power  of  ten,  and  showing  either  sign. 

At  this  time  there  exists  no  theory  with  which  to  compare  these  results  for  dif- 
ferent metals  and  substrates,  but  they  appear  to  offer  sufficient  variety  to  indicate  that 
the  method  proposed  here  can  be  used  to  study  interface  stresses  caused  by  charging  in 
quite  some  detail. 

For  example,  the  result  for  the  Ag-Au  alloy  falls  between  the  values  for  pure  Ag 
and  Au,  and  indicates  that  there  exists  one  composition  at  which  the  charge-induced 
surface  stresses  at  the  mica  interface  vanish,  at  least  at  room  temperature.  In  terms 
of  a model  of  surface  piezoelectricity,  this  implies  that  this  particular  interface  retains 
its  centrosymmetric  character  so  that  piezoelectric  effects  have  to  be  absent.  Similarly, 
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the  large  stress  value  for  the  Cu/mica  interface  implies  a particularly  strong  response 
to  electrostatic  charging,  which  is  of  interest  whenever  copper  is  used  as  a shield. 
Finally,  the  great  variation  observed  between  different  metal/substrate  combinations 
appears  to  make  it  impossible  to  extrapolate  the  data  to  other  combinations. 

It  should  be  kept  in  mind,  however,  that  while  the  surface  layer  thickness  t does 

s 

not  appear  explicitly  in  the  final  result,  Eq.  (4),  its  magnitude  is  nevertheless  pertinent 
to  the  reasonableness  of  the  above  analysis.  Measurements  in  the  Au-Ag/mica  system, 
using  a very  thin  layer  of  Ag,  indicate  the  thickness  of  penetration  of  the  strains  to  be 
of  the  order  of  25  K.  Hence  the  model  of  a two-parallel-layers  system  applies  strictly 
only  to  those  metals  whose  carriers  have  a mean  free  path  of  order  t or  less,  i.e., 
the  two  Ni-Fe  alloy  compositions  of  Table  I.  In  all  other  cases,  conduction  in  the  sur- 
face region  has  to  be  treated  on  a more  microscopic  basis,  but  this  should  primarily 
affect  the  equality  of  the  ratio  y^/y.  Since  the  range  of  y's  encountered  among  widely 
differing  metals  is  relatively  small,  such  a deviation  should  not  be  large  enough  to  up- 
set the  relative  magnitudes  of  the  deduced  stresses  shown  in  Table  I. 
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ELASTORESISTANCE  AND  ELECTRON  TUNNELING  IN  OXIDIZED  IRON 
T.  Pignataro,  H.J.  Juretschke  and  M.  Eschwei 

As  part  of  a study  of  the  microwave  properties  of  metal  surfaces  covered  with  in- 
homogeneous oxides,  we  have  attempted  to  build  prototype  structures  of  two  metallic 
regions  connected  by  an  oxide  of  the  kind  likely  to  be  found  under  natural  oxide  growth 
Conditions,  and  to  measure  their  room- temperature  electrical  properties.  We  chose 
iron  as  the  metal,  because  it  and  its  oxides  are  an  often- encountered  practical  metal- 
lic surface,  even  though  its  conventional  MOM  junctions,  obtained  by  thin  film  overlays, 
are  largely  ohmic  at  room  temperature.*  This  is  attributed  to  the  significant  conductiv- 
ity of  the  pertinent  oxides,  or  to  a very  low  barrier  at  the  oxide  interface.  We  have 
therefore  concentrated  on  different  MOM  geometries. 

The  most  successful  structure  has  been  obtained  using  the  configuration  shown  in 
Figure  1.  An  iron  film  of  about  1000  X is  deposited  on  a substrate,  and  a thin  line 


Fig.  1.  Method  of  preparing  thin-line  junctions  of  FeOx  between  two 
iron  films,  (a)  Thin- line  deposit  in  the  shadow  of  a Wire 
mask;  (b)  Schematic  oxide  growth  in  the  thin- line  region. 

across  the  film  is  produced  by  shadowing  the  deposition  with  a 10  *cm  diameter  wire 
stretched  about  10  * cm  above  the  substrate.  This  line,  shown  in  cross  section  in 
Fig.  1(a),  is  optically  transparent,  but  electrically  continuous,  and  of  very  low  resist- 
ance. If  the  sample  is  then  exposed  to  a wet  atmosphere  at  elevated  temperature,  an 
oxide  grows  all  along  film  surface,  and  forms  a layer  of  the  form  shown  schematically 
in  Figure  1(b).  This  geometry  produces  an  Fe-FeO^-Fe  junction  of  a cross  sectional 
area  significantly  smaller  than  obtained  in  the  usual  film  overlay  structure.  It  there- 
fore tends  to  magnify  the  influence  of  the  junction  in  transport  along  the  film. 

The  room  temperature  electrical  measurements  on  samples  prepared  in  this  man- 
ner show  three  important  changes  in  properties  relative  to  those  of  the  initial  configura- 
tion of  Fig.  1(a): 
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(1)  The  resistance  increases  significantly  faster  with  oxidation  than  expected  by 
the  uniform  thinning  of  the  sample  because  of  the  oxide  skin. 

(2)  The  I-V  characteristic  becomes  nonlinear,  with  V(-I)=-V(I),  at  current 
values  that  become  progressively  smaller  as  the  sample  resistance  increases. 

(3)  The  strain  sensitivity  of  resistance  (l/R)(dR/de)  increases  with  increasing 

R,  and  also  shows  a nonlinearity,  becoming  smaller  as  the  current  increases. 

A typical  I-V  characteristic  is  shown  in  Fig.  2,  for  a sample  containing  three 

V 

2 


I 


30  60  m* 

Fig.  2.  I-V  characteristic  of  a lightly  oxidized  sample  having 
three  thin-line  junctions  in  series. 

lightly  oxidized  thin-line  junctions  in  series.  Its  initial  shape  has  the  form  expected 
for  current  tunneling  through  a barrier.  At  higher  currents,  heating  of  the  sample 
causes  deviations  from  the  ideal  exponential  tunneling  behavior.  Also,  some  samples 
show  a slight  opening  of  the  I-V  loop,  suggesting  that  some  type  of  switching  between 
different  current  states,  such  as  is  found  in  amorphous  materials,  may  play  a role  in 
conduction  through  the  barrier. 

The  elastoresistance  response  of  a sample  similar  to  that  of  Fig.  2 is  summariz- 
ed in  Table  I below.  Clearly,  the  presence  of  the  junctions  enhances  the  effect  of  strain, 
in  all  cases.  The  pronounced  decrease  in  the  strain  gauge  factor  (l/R)(dR/de)  with 
large  currents  is  in  qualitative  agreement  with  the  behavior  expected  from  tunneling 
junctions.  However,  the  change  in  angular  dependence  of  the  gauge  factor  with  increased 
currents,  indicating  an  appreciable  anisotropy  in  the  elastic  behavior  of  the  junction,  is 
not  yet  understood. 

The  I-V  characteristic  of  the  simplest  model  of  a square  barrier  junction  is  given 
at  low  currents  by^ 
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TABLE  I.  Strain  gause  factor  (l/R)(dR/de)  for  oxidized  thin  iron 
films:  (a)  of  uniform  thickness,  (b)  containing  three 
thin-line  junctions  in  series.  8 is  the  angle  of  the  ap- 
plied strain  relative  to  the  current. 


e 

(a) 

5 mA 

(b) 

10  mA 

15  mA 

0° 

4.0 

11.3 

8.3 

6.0 

45° 

2.6 

9.5 

7.5 

6.  1 

sO 

O 

0 

1.3 

8.8 

7.5 

6.2 

I/V  = Ae*Pt+  B(1  + pt/4)e*^tV2  (1) 

where  A,  B and  p are  constants  depending  on  the  work  function  <f>  of  the  barrier,  and  t 
is  the  barrier  thickness.  This  equation  predicts  a gauge  factor  for  the  junction,  at 
constant  I,  of  the  form 

(l/V)(dV/de)  = pt[l  - — ~(9/4  + l/2pt)V2]  (2) 

1 24  <(> 

This  result  contains  the  observed  decrease  with  increasing  I (or  V).  It  predicts 
a primarily  linear  growth  of  the  gauge  factor  with  barrier  thickness,  and  it  allows 
estimating  an  absolute  magnitude  of  the  strain  sensitivity  of  a junction. 

For  a Fe-FeO^-Fe  junction,  <j>  is  expected  to  be  below  1 eV,  and  then  p is  a 
fraction  of  1 Hence  a barrier  with  t - 5 A would  be  compatible  with  the  results  of 
Table  I.  In  fact,  it  is  very  likely  that  p is  much  smaller  and  t much  larger  than  this 
first  estimate.  At  this  point  the  most  important  conclusion  is  that  the  barrier  model 
gives  magnitudes  and  trends  of  the  correct  order. 

The  prediction  that  the  strain  sensitivity  is  enhanced  appreciably  by  thicker  bar- 
riers suggests  that  such  junctions  on  the  surfaces  of  oxidized  iron  and  other  metals  may 
contribute  to  the  relatively  large  strain  dependence  of  the  surface  impedance  observed 
on  such  surfaces? 
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NEW  SOLID  STATE  MATERIALS 

E.  Banks,  S.  Nakajima,  R.  Sacks  and  M.  Shone 

Research  in  the  area  of  new  materials  is  devoted  to  synthesis,  crystal  growth  and 
characterization  of  materials  having  interesting  electrical,  optical  and  magnetic  prop- 
erties which  may  provide  a basis  for  new  optical,  electro- optic  and  magneto-optic  de- 
vices. Section  A describes  the  structural  and  luminescence  properties  of  the  new 
complex  fluorides  of  divalent  Sm  and  Eu  with  Mg,  their  behavior  in  dilution  with 
SrMgF^,  and  the  characterization  of  the  striking  energy  transfer  from  Eu^+  to  Sm^+in 
solid  solutions  of  SmMgF^  in  EuMgF^.  Section  B gives  the  result  of  a single  crystal 
X-ray  analysis  of  a tetragonal  mixed  fluoride  of  composition  KQ  ^Mn^  5Feo  54*3 • 
Section  C discusses  the  properties  of  a new  series  of  solid  solutions  of  LiInP?07  in  . 
cubic  ZrP^O^  in  which  up  to  20  mole  of  Li  ions  have  been  substituted,  making  the  ma- 
terial a promising  candidate  for  a solid  electrolyte  in  batteries  based  on  Li  metal,  for 
applications  requiring  high  energy  density. 

A.  Fluorescence  in  New  Divalent  Rare  Earth  Magnesium  Fluorides 

Last  year,  we  reported  on  the  synthesis  and  some  luminescent  properties  of  new- 
ly discovered  ternary  fluorides  of  composition  EuMgF^,  SmMgF^  and  SrMgF^,  and  some 
of  the  solid  solutions  formed  among  them.  The  unit  cells  of  all  three  compounds  are 
orthorhombic,  similar  to  the  unit  cells  of  BaMnF^*  and  BaMgF^,  the  latter  two  being 
piezoelectric  at  room  temperature  and  showing  non-linear  optical  behavior  such  as 
frequency  doubling.  An  apparent  anomaly  in  the  unit  cell  parameters  as  compared  to 
those  of  BaMgF^  has  been  resolved  by  a more  careful  analysis  of  the  X-ray  data  on  our 
materials.  The  unit  cells  are  compared  below; 


a(A) 

. b(A) 

c(A) 

BaMgF^ 

5.81 

14.509 

4.  125 

SmMgF4 

5.661 

14.440 

3.965 

EuMgF^ 

5.658 

14.430 

3.933 

SrMgF4 

5.637 

14.459 

3.917 

In  contrast  to  the  barium  compounds,  which  melt  congruently,  and  thus  can  be 
obtained  as  sizeable  single  crystals,  all  three  of  these  new  compounds  decompose  on 
melting  (near  900°C)  into  the  binary  fluorides.  We  had  previously  reported  difficulty 
in  preparing  the  Sm  and  Eu  compounds  directly  from  the  difluorides,  necessitating  the 
use  of  Mg  vapor  on  mixtures  of  the  trifluorides  and  MgF^.  This  has  now  been  found  to 
be  a result  of  the  presence  of  some  oxide  in  the  starting  materials,  which  was  eliminated 
by  preheating  the  starting  materials  (EuF^  and  SmF^)  with  ammonium  fluoride.  At- 
tempts to  grow  single  crystals  have  been  unsuccessful  thus  far  because  of  the  incongruent 
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melting  behavior.  Flux  growth  experiments,  using  molten  MgCI^  and  SrCl^,,  are  now 
being  set  up  for  SrMgF^  as  a model  compound  and  host  crystal  for  EuMgF^  and  SmMgF^. 
In  the  absence  of  single  crystals,  we  have  been  attempting  to  elucidate  the  crystal  struc- 
ture from  powder  X-ray  data.  Assuming  the  published  structure^  of  BaMnF^,  structure 
factor  calculations  on  EuMgFj  were  carried  out  and  compared  with  those  derived  from 
the  observed  powder  X-ray  intensities.  While  satisfactory  agreement  was  not  obtained, 
the  results  suggested  that  the  structures  are  related.  A test  for  second  harmonic  gen- 
eration was  negative,  indicating  that  EuMgF^  may  be  centrosymmetric  and  suggesting 
the  possibility  that  the  structure  may  be  that  of  the  hypothetical  high- temperature  form 

of  BaMgF.  and  that  EuMgF.  may  undergo  a low- temperature  transition  to  a ferroelectric 

4 4 151 

phase.  We  are  now  setting  up  the  measurement  of  the  Eu  Mossbauer  spectrum 

which  may  yield  information  about  the  existence  of  such  a transition.  A large  sample 

of  SrMgF.  is  being  prepared  for  a study  of  the  structure  by  neutron  diffraction,  using 

4 3 

the  new  method  of  line  profile  analysis  for  structure  refinement  on  polycrystalline 
samples.  Attempts  to  prepare  thin  films  of  EuMgF^  by  vacuum  evaporation  failed  be- 
cause of  the  decomposition  of  the  samples  in  the  evaporation  vessel. 

The  luminescence  properties  of  these  new  compounds  are  quite  unusual.  The 
compound  EuMgF^  displays  a bright  blue  fluorescence  under  365  nm  excitation.  The 
blue  emission  occurs  in  a broad  blue  band  peaking  at  437  nm.  The  excitation  spectrum 
is  also  a broad  band  with  a maximum  at  350  nm.  In  solid  solution  with  SrMgF^,  the 
maximum  efficiency  is  found  at  75  mole  % EuMgF^,  but  the  efficiency  of  pure  EuMgF^ 
is  only  about  5%  (relative)  less  than  that  of  the  brightest  solid  solution.  This  indicates 
the  near-absence  of  concentration  quenching  in  this  material.  In  contrast,  the  bright- 
ness of  solid  solutions  of  SmMgF^  in  Sr  MgF^  is  a maximum  at  10  mole  % SmMgF^,  in- 
dicating substantial  concentration  quenching.  This  behavior  of  EuMgF^  places  the 
material  in  the  class  of  "stoichiometric"  phosphors  where  the  active  luminescent 

species  can  be  present  in  very  high  concentration  without  reducing  the  luminescent  ef- 

4-  6 7 8 

ficiency.  Examples  of  such  phosphors  are  NdP^O^  and  LiNdP^O^.  ’ The  latter 
materials  have  been  made  into  miniaturized  solid  state  lasers,  emitting  in  the  near 
infrared  (1.06^tm).  The  highly  efficient  broadband  blue  emission  of  EuMgF^  and  its 
solid  solutions  with  SrMgF^  and  BaMgF^  suggests  that  lasers  tunable  in  the  blue  region 
of  the  spectrum  can  be  made  from  single  crystals  of  these  materials,  by  controlling 
the  optical  feedback  under  flashlamp  excitation. 

Pure  SmMgF^  fluoresces  a weak  red  under  365  nm  excitation,  but  its  excitation 
maximum  is  near  470  nm;  the  emission  spectrum  is  a series  of  sharp  lines  in  the  red 
and  near  IR,  with  the  strongest  peak  near  680  nm.  In  solid  solution  in  SrMgF^,  maxi- 
mum efficiency  is  found  at  10  mole  % SmMgF^,  indicating  substantial  concentration 

I 
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quenching,  as  stated  above.  In  solid  solutions  of  composition  EUj  ^Sm^MgF^,  the  flu- 
orescence under  365  nm  excitation  shows  no  Eu^+  emission  for  Sm  concentrations  above 
10  mole  %,  and  very  little  Eu  emission  at  1 mole  % Sm,  even  though  excitation  is  near 
the  maximum  for  Eu2+.  This  indicates  very  efficient  energy  transfer  from  excited 
Eu^+  centers  to  Sm^+  centers.  The  complete  absence  of  the  Eu*"  + spectrum  in  samples 
containing  more  than  1%  Sm  suggests  that  the  major  mechanism  of  energy  transfer  is 
non- radiative  (resonance  transfer).  Measurements  of  the  lifetime  of  the  excited  state 

of  the  Eu^+  have  confirmed  that  this  is  occurring.  The  excited  state  lifetime  in  pure 

- 7 - 9 2 + 

EuMgF^  is  about  10  sec.  This  is  reduced  to  10  sec  (for  the  Eu  emission)  when 

1%  SmMgF.  is  in  solid  solution.  This  substantial  shortening  of  the  fluorescent  lifetime 

4 2+2  + 
is  strong  evidence  for  non- radiative  energy  transfer  from  Eu  to  Sm  . An  experi- 
ment in  which  a mixed  powder  of  EuMgF  and  SmMgF,  was  irradiated  with  365  nm 

**  2 + **  2 + 

radiation  also  showed  excitation  of  the  Sm  red  emission,  but  the  Eu  emission  was 

also  present.  This  indicates  that  radiative  energy  transfer  is  possible,  but  does  not 

2 + 

play  a major  role  in  the  solid  solution  materials,  which  do  not  show  the  Eu  emission. 

This  indicates  that  the  non- radiative  transfer  rate  is  very  rapid  compared  to  the  excited 

2 + 

state  lifetime  of  the  Eu 

B.  Mixed  Alkali  Transition  Metal  Fluorides 

Work  in  this  area  has  been  deferred  in  favor  of  the  research  discussed  above.  A 

crystal  structure  analysis  has  been  completed  on  one  member  of  the  series 

K Mn  Fe,  F,,  nominally  K_  rMn.  cFe.  -F,,  whose  composition  was  determined 
x x 1-x  3 0.5  0.5  0.5  3 r 

c rystallographically  to  be  KQ  54MnQ  5FeQ +04Fe0 +5 F3 ' The  results  are  summarized 
as  follows:  A specimen  of  flux-grown  KQ  54(Mn,Fe)F^  was  studied  at  room  temperature 
by  single  crystal  X-ray  diffractometry . The  crystal  has  a structure  of  the  tetragonal 
tungsten  bronze  type  with  a doubled  c^  axis . The  unit  c ell  parameters  are  a = 12.765(1)X, 
c =8.002(1)  it,  and  the  space  group  is  P^bc  . Least- squares  refinement  was  carried 
out  with  2395  symmetry- independent  reflections  collected  with  an  automatic  diffracto- 
meter  (Mo-Ka  radiation).  The  final  Rvalue  (£  |Fq  - kF^  |/2|Fo  |)  is  0.049,  with 
anisotropic  thermal  parameters.  In  the  structure,  potassium  atoms  fully  occupy  penta- 
gonal (CN  =15)  sites  and  partially  occupy  tetragonal  (CN  =12)  sites  . The  transition 
metal  ions  occupy  three  different  kinds  of  octahedra.  The  mean  M-F  distances  in  each 
octahedron  are:  2.  100  A for  M(  1 )- F,  1 . 939  A for  M(2)- F,  and  1.995  A for  M(3)-F. 

These  distances  indicate  that  the  M(l)  sites  are  mainly  occupied  by  bivalent  ions,  chiei- 
ly  Mn^+,  the  M(2)  sites  by  Fe^+  and  the  M(3)  sites  by  Mn^+  and  Fe'?+. 
o 

A paper  on  this  work  has  been  submitted  to  Acta  Crystallographies . 
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C.  Potential  New  Solid  Electrolytes 

We  have  continued  in  the  attempt  to  introduce  large  concentrations  of  alkali  metals 
into  cubic  ZrP^O^,  whose  structure  contains  large  interconnected  cavities  making  it  a 
candidate  for  application  as  a solid  electrolyte  in  batteries  based  on  alkali  metals. 

Previously  we  had  reported  the  introduction  of  up  to  10  mole  % Li+  ions,  compensated 
3+  4 + 

by  Y replacing  Zr  ions.  Recently  we  have  succeeded  in  introducing  up  to  20  mole 
% Li  into  this  compound.  This  was  accomplished  by  using  indium  as  the  charge-com- 
pensating ion.  Compositions  formulated  as  Zr . Li  In  P->07  were  prepared.  Samples 
with  values  of  x from  0 to  0.2  show  only  the  pattern  of  cubic  ZrP^O^,  with  line  shifts 
indicating  solid  solution  formation.  At  higher  concentrations  (x  values)  a new  phase 
appears,  apparently  LilnP^O^.  This  composition  appears  to  be  a previously  unreported 
compound.  The  powder  diffraction  pattern  appears  to  be  orthorhombic  from  a prelim- 
inary study.  We  are  now  preparing  larger  samples  for  study  by  nuclear  magnetic 
resonance  to  determine  whether  the  lithium  ions  are  mobile,  and  to  determine  whether 
ceramic  samples  can  be  prepared  for  conductivity  measurements. 

Continuation  of  work  in  this  area  has  been  funded  for  two  years  by  a grant  from 
the  Army  Research  Office. 

Joint  Services  Technical  Advisory  Committee 

F 44620-74-  C-0056  E.  Banks 

U.S.  Army  Research  Office 
DA  AG-  29- 75  - C- 0096 
DAAG-  29-  78  - G- 0105 
DAAG-29-78-G-0150 
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ULTRASONIC  AND  MICROWAVE  DIELECTRIC  RELAXATION  OF  LIQUID  DIALKYL 
CARBONATES 

D.  Saar,  J.  Brauner,  H.  Farber  and  S.  Petrucci 

Ultrasonic  absorption  data  for  pure  dimethyl,  diethyl,  dipropyl,  dibutyl  and  pro- 
pylene carbonates  in  the  frequency  range  of  3- 300  MHz  and  in  the  temperature  range  of 
25- 55°C  are  reported.  An  ultrasonic  relaxation  process  of  the  Debye  type  with  a re- 
laxation frequency  fR  = 8 ^-lOMHz  at  25°C,  which  is  largely  independent  of  the  length 
of  the  alkyl  chain  in  the  alkoxy  group,  has  been  found  for  the  non-cyclic  carbonates. 

No  such  process  has  been  observed  for  the  cyclic  propylene  carbonate  and  for  dimeth- 
oxymethane  which  lacks  the  carbonyl  group. 

The  observed  relaxation  process  is  interpreted  as  being  due  to  a cis-trans  isom- 
erization of  the  alkoxy  groups.  By  using  Lamb's  approach  for  thermal  relaxation 
processes  both  the  activation  parameters  AH^  and  AS^  for  the  reverse  step  of  the  equi- 
librium and  the  thermodynamic  parameters  AH°  and  K for  the  equilibrium  have  been 
determined . 

Complex  dielectric  permittivities  in  the  microwave  frequency  range  of  0.6-67  GHz 
for  the  above  non-cyclic  carbonates  at  25°C  are  also  reported.  The  data  can  be  de- 
scribed over  almost  all  the  frequency  range  studied  by  a single  Debye  relaxation  process 
The  high  end  of  the  frequency  range  shows  some  positive  deviations  of  the  loss  in  ac- 
cord with  literature  data.  The  dielectric  relaxation  process  is  interpreted  as  due  to  a 
segmental  motion  of  the  molecule  under  the  influence  of  the  electric  field.  The  different 
nature  of  the  relaxation  mechanisms  for  the  ultrasonic  and  dielectric  processes  is 
attributed  to  the  different  type  of  perturbing  function  inherent  in  the  two  methods. 

A.  Introduction 

Mechanical  waves  of  ultrasonic  radio  frequencies  have  been  used  in  the  past  to 
investigate  the  dynamics  of  molecular  relaxation  in  pure  liquids.1  Similarly,  electro- 
magnetic waves  at  microwave  frequencies  have  been  employed  to  study  the  decay  of 
polarization  in  these  same  substances  due  to  the  relaxation  of  the  orientation  of  dipolar 
molecular  groups.^  Very  rarely  have  both  types  of  waves  been  employed  by  the  same 
research  group  to  investigate  the  molecular  dynamics  of  organic  liquids,  the  most 
common  situation  being  mutual  ignorance  of  the  information  obtainable  by  the  alternate 
tool.  Because  of  the  different  natures  of  the  perturbing  waves  complementary  informa- 
tion of  great  value  can  be  obtained  by  the  use  of  both  techniques.  The  compressions 
and  rarefactions  caused  by  the  mechanical  waves  can  disturb  a pressure  sensitive  equi- 
librium which  has  a AV  (=  AV  - 0AH/oCp)  / 0 as  indicated  by  the  well-known  relation 

S 1 A 


(3fnK/3p)  =-AV/RT.3,4 
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Thus,  the  concentration  of  a molecular  form  which  is  small  relative  to  some 
other  form  at  low  pressure  may  be  substantially  increased  at  high  pressure  if  its  molar 
volume  is  smaller  or  its  enthalpy  is  higher  than  that  of  the  other  form. 

An  applied  electric  field,  for  example,  in  the  form  of  an  electromagnetic  wave, 

may  also  cause  a shift  in  the  equilibrium  between  two  molecular  forms.  If  the  two 

forms  have  a different  dipole  moment  the  equilibrium  constant  relating  them  can  be 

changed  according  to  the  relation  (3  f nK/3  E)  = AM/RT,  where  AM  is  the  change  in  the 

4 

dipole  moment  per  mole. 

Alternatively,  if  a given  molecular  form  with  a non- zero  dipole  moment  (because 
of  the  presence  of  one  or  more  polar  groups)  always  predominates,  a dielectric  relaxa- 
tion process  associated  with  the  orientations  (following  the  alternating  field)  of  one  or 
more  of  these  polar  groups  will  be  present.  In  this  latter  case  the  dielectric  relaxation 

sje 

process  may  not  be  related  to  the  ultrasonic  relaxation  process. 

Dialkyl  carbonates  were  chosen  as  the  starting  point  because  of  their  relation  to 
esters  which  contain  one  of  the  two  alkoxy  groups  present  in  the  carbonates  and  because 
of  the  possibility  that  these  molecules  might  be  considered  model  monomers  for  the 
corresponding  polycarbonate  polymers. 

B.  Experimental 

1 . Apparatus 

The  ultrasonic  equipment  and  procedures  have  been  described  elsewhere/’  A 
linear  least  squares  method  was  applied  to  all  the  absorption  (dB)  data  versus  distance 
at  each  frequency.  This  improved  the  precision  of  the  calculated  absorption  coefficients 
and  gave  correlation  coefficients  better  than  r =0.99  in  all  instances.  The  dielectric 
equipment  and  procedures  have  been  described  elsewhere.^ 

Cannon  viscometers  No.  0 and  No.  1 (Cannon,  University  Park,  Pennsylvania) 
with  manufacturer  calibration  certificates  were  used.  Two  50ml  pycnometers  calibrated 
with  distilled  water  at  25 . 0°C  were  also  used.  Thermostatting  of  the  ultrasonic  and 
dielectric  cells  and  of  the  viscometers  and  pycnometers  was  with  i 0.05°C. 

2.  Materials 

Diethyl-,  dipropyl-  and  dibutylcarbonates  (Eastman  Kodak)  were  vacuum  distilled 
in  an  all  glass  apparatus  and  used  shortly  thereafter.  Dimethoxymethane  (lab  stock) 
was  distilled  at  atmospheric  pressure.  Propylene  carbonate  (Eastman  Kodak)  was  used 


Associated  with  the  dipolar  orientation  of  the  predominant  species. 
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■without  further  purification,  the  result  being  only  of  a qualitative  nature. 

C.  Results  and  Calculations 
1 . Ultrasonic  Relaxation 

Figure  1 shows  the  results  expressed  as  a/ f2  versus  the  frequency  f (MHz)  for 
dimethyl  carbonate  and  diethyl  carbonate  at  the  various  temperatures  investigated,  a 
is  the  sound  absorption  coefficient  (neper  cm  ^).  The  solid  lines  represent  values 


given  by  a Debye  type  function  for  a single  relaxation  process: 
a/i 2 = (A/1  + (f/fR)2)  + B 


3,4 


(1) 


Figure  2 shows  similar  plots  for  the  other  two  carbonates  investigated.  Table  I 
collects  the  quantities  A,  B,fD  and  the  sound  velocities  for  these  systems.  The  sound 
velocities  are  linear  functions  of  temperature  expres sable  by  the  equations  calculated 
by  linear  regression: 


dimethyl  carbonate: 
diethyl  carbonate  : 
dipropyl  carbonate: 
dibutyl  carbonate  : 


u = 1 196-  3 . 67(t- 25)  m/s  r = 0.996 
u = 1179-3. 47(t-25)  m/s  r2  = 0.985 

0.995 
0.988 


u = 1 122- 4.  23(t- 25)  m/s  r* 
u = 1253-  3. 20(t-25)  m/s  r' 


where  r is  the  correlation  coefficient. 

- 1 2 

TABLE  I.  Ultrasonic  parameters  A,  B (cm  sec  ) and  f^  (MHz) 
according  to  Eq.  (1)  for  the  liquid  carbonates  investi- 
gated at  25,  40  and  55°C.  Corresponding  sound  veloc- 
ities (m/sec)  for  the  same  systems  and  temperatures. 


liquid 

B 

AxlO17 

-1  2 
cm  sec 

17 

Bx  10 

- 1 2 
cm  sec 

fR 

MHz 

u 

meter  sec  * 

Dimethyl  carbonate 

25 

4500 

56 

8.5 

1 198 

40 

29  30 

70 

15 

1137 

55 

2230 

70 

22 

1088 

Diethyl  carbonate 

25 

2357 

43 

9.5 

1 176 

40 

1570 

30 

17 

55 

1100 

50 

30 

Dipropyl  carbonate 

25 

1600 

50 

10 

40 

1140 

60 

15 

ll6-i 

55 

850 

60 

24 

1092 

Dibutyl  carbonate 

25 

1540 

60 

8.5 

1256 

40 

1100 

70 

12 

1199 

55 

750 

J 

60 

20 

1160 
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Figure  3 shows  that  for  propylene  carbonate  which  has  its  alkoxy  groups  held  in 
a ring  structure,  no  relaxation  is  visible  at  25°C,  whereas  for  all  the  other  liquids  at 

Q 

25  C a relaxation  process  centered  around  8. 5- 10  MHz  is  apparent  (Table  I). 


Fig.  3.  Ultrasonic  absorption  expressed  as  a/i  vs.  the 
frequency  f for  propylene  carbonate  and  dimeth- 
oxymethane  at  25°C. 


Included  in  Fig.  3 is  the  ultrasonic  data  at  25°C  for  dimethoxymethane  expressed 
as  o/f2  versus  f.  The  last  molecule  has  no  carbonyl  group  but  rather  a methylene 
group  to  which  the  alkoxy  groups  are  attached,  and  again  no  relaxation  process  is  evi- 
dent for  this  molecule. 

2.  Dielectric  Relaxation 

Figure  4 displays  the  quantities  e'  and  e",  the  real  and  imaginary  parts  of  the 
complex  permittivity,  plotted  versus  f (GHz)  for  liquid  diethyl,  dipropyl,  and  dibutyl 
carbonates  at  25°C.  The  data  for  dimethyl  carbonate  has  been  reported  previously.^ 

$ 

Shown  here  for  comparison  sake. 


teal  part  s'  and  coefficient  of  the  imaginary  part  c"  of  the  complex  permittivity 
dotted  vs.  the  frequency  f (GHz)  for  dimethyl,  diethyl,  dipropyl  and  dibutyl 
arbonates  at  25°C.  The  solid  lines  are  calculated  functions  according  to  singl 
)ebye  relaxation  function. 
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Figure  5 gives  the  Cole-Cole  plots  of  e"  versus  e1  for  the  investigated  liquids.  The  solid 
lines  represent  the  values  given  by  Debye  type  functions  for  a single  relaxation  process^ 

(2) 

E"=  (£o  * £oo)(f/fR)/( 1 +<f/fR)2) 


In  Fig.  4 the  data  for  diethyl  carbonate  show  a deviation  from  a simple  Debye  process 

8 

at  high  frequencies  {>35  GHz).  Recent  literature  data  for  dimethyl  carbonate  and 
diethyl  carbonate  show  positive  deviations  from  the  Cole- Cole  locus  at  high  frequencies. 
This  deviation  has  been  attributed  to  a non- Debye  absorption  process  in  the  far  IR  at 
millimeter  wavelengths.  By  non-Debye  is  meant  not  associated  with  dipolar  orienta- 
tional relaxation.  We  will  purposely  neglect  this  contribution  to  the  total  loss  e thus 
the  extrapolated  quantity  retains  the  formal  significance  of  the  real  part  of  the  com- 
plex permittivity  just  after  the  dipolar  orientation  polarization  has  relaxed.  Apparent 
relaxation  times  were  thus  calculated  from  the  one  term  Debye- type  relaxation  func- 
tions Eq.  (2)  and  will  be  compared  among  themselves  for  the  liquids  investigated. 

Table  II  presents  the  values  of  £ , c and  f„  for  these  liquids. 

o » R 


TABLE  II.  Apparent  relaxation  parameters  e0,  too.fR,  microscopic 
relaxation  time  Tm,  shear  viscosity  q , density  p,  molar 
volume  V,  molecular  volume  v for  the  dialkyl  carbonates 
studied  in  this  work  at  t = 25°C. 


Liquid 

H 

E» 

fR 

T x 10  1 2 
m 

n 

* 

P 

V 

vx  1022 

carbonate 

■ 

(GHz) 

sec 

cpoise 

gr/cm3 

cm3/mole 

cm3/mole 

Dimethyl^ 

3.  12 

2.  35 

22 

6.64 

0.585 

m 

84.739 

1.408 

Diethyl 

2.84 

2.  31 

16 

9.33 

0.  750 

DBS 

121. 92 

2.025 

Dipropyl 

2.73 

2.  29 

10 

15.06 

1. 243 

156-0g 

2.593 

Dibutyl 

2.60 

2.  25 

8 

19.0 

o 

1.717 

BH 

189. 49 

3.  148 

The  dependence  of  density  on  temperature  for  the  last  three  liquids, 
in  the  temperature  range  25-55°C,  can  be  expressed  by  the  relations 
determined  by  linear  regressions. 

Diethyl  carbonate:  p = 0 . 9969  - 1 . 1 1 x 10*  3t,  °C  r2  = 0.999 
Dipropyl  carbonate:  p = 0 . 9586  - 0 . 90?  x l(f  3t,  °C  r‘  = 0.993 
Dibutyl  carbonate:  p = 0 . 9439  - 0 . 97§  x 10*  3t,  °C  r2  = 0.999 
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Fig.  5.  Cole- Cole  plots  ol  the  quantity  e"  vs,  t1  for 
dimethyl,  diethyl,  dipropyl  and  dibutyl  car- 
bonates at  25°C. 

D,  Discussion 

1 . Ultrasonic  Relaxation 

Of  the  liquid  carbonates  investigated  only  propylene  carbonate  has  its  alkoxy  groups 
largely  immobilized  by  a ring  structure;  only  for  this  carbonate  is  a relaxation  process 
absent.  Therefore  the  hypothesis  is  advanced  that  the  molecular  mechanism  of  the 
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ultrasonic  relaxation  process  involves  a cis-trans  isomerization  of  the  alkoxy  groups. 
Specifically,  we  propose  an  equilibrium  between  the  two  possible  structures 


R 
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/ 

0 

\ 

R 

cis-cis 


9 

which  is  similar  to  that  postulated  by  previous  workers  for  the  case  of  the  alkyl  esters 
where  one  of  the  alkoxy  groups  of  the  carbonates  is  replaced  by  an  alkyl  group.  (A 
concerted  type  of  mechanism  involving  a cis-cis  ^ x trans-trans  equilibrium  is  not 
ruled  out. ) 
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C = 0 


/ 

0 

\ 


cis- trans 


The  existence  of  at  least  two  forms  for  dimethyl  carbonate  has  been  reported  re- 
cently by  Katon  and  Cohen^  based  on  IR  spectra.  They  obtained  a AH  = 2.6  - 0.5 
kcal/mole  for  the  isomerization  equilibrium. 

From  Table  I it  may  be  seen  that  the  relaxation  frequencies  at  25°C  are  of  the 
same  order  of  magnitude  for  the  non- ring  carbonates  investigated.  This  means  that, 
if  the  equilibrium  proposed  above  is  correct,  the  major  contribution  to  the  energy  bar- 
rier of  the  isomerization  process  comes  from  the  rotation  about  the  C-0  bond  (and  not 
from  the  hydrodynamic  resistance  to  the  motion  of  the  alkyl  group  which  goes  from  one 
to  four  carbon  atoms  in  length).  This  at  least  seems  to  be  true  over  the  variation  in 
the  length  of  the  alkyl  chain  studied. 

Another  check  on  the  validity  of  this  argument  was  made  by  studying  dimenthoxy- 
methane.  Here  the  C=0  group  is  replaced  by  a CH^  group  (probably  causing  the  C-0 
bonds  to  become  true  single  bonds).  This  should  cause  more  molecular  flexibility  and 
permit  free  rotation  of  the  alkoxy  groups;  indeed  at  25°C  no  ultrasonic  relaxation 
process  is  visible  over  the  entire  frequency  range  investigated  (Figure  3). 

Given  the  molecular  process 


with  K = kj/k^  one  can  write  the  following  equations:^ 
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AS' 
r 

T*  1 = 2n  f„  = k,  + k = k ( 1 + K)  = ^ e R 
R f r r h 


e RT  {1  + K) 


where  the  symbols  used  have  their  usual  meanings. 

11 


Rearranging 


in(x*l/T)  = [ln£  + 


we  recover: 

/ 

A-AA  A 

h 


AHr  AHr 

r 1 r + in(  1 + K)  . 


] * 


K AH 
K + 1 R 


R 1 RT 

A plot  of  fn(T*  VT)  versus  l/T  yields  a straight  line  of  slope: 

slo?e  = d/d(I/T)T)  = • ir 

and  showing  as  intercept: 

k ASt 

intercept  = fn[^-  + ^ ] 

Plots  of  the  quantity  fn(r  Vt)  versus  l/T  are  shown  in  Figure  6.  The  calculated  values 

of  AS^  are  given  in  Table  III.  To  calculate  AH^  one  needs  to  know  AH°  as  is  obvious 

from  Equation  (4).  This  latter  parameter  is  calculable  from  the  maximum  excess  sound 

absorption  per  wavelength,  p = (A/2)uf  . The  values  appearing  on  the  right  hand 

msx  x\. 

side  of  the  above  equation  have  been  tabulated  in  Table  I. 

TABLE  III.  Activation  parameters  AS^ , AHr  and  thermodynamic 
parameters  AH°  and  equilibrium  constant  K for  the 
isomeric  equilibria  in  liquid  dialkyl  carbonate  studied 
by  ultrasonic  relaxation. 


Liquid 

carbonate 

A? 

r 

e.  u.  (=ca*/mole  deg.) 

mm 

AH° 

Kcal / . 

/mole 

KxlO3 

Dimethyl- 

-4-6 

5 ‘ 5 

2.85 

8-l 

Diethyl- 

" 0 ' 1 1 

6,8 

3-72 

°-19 

Dipropyl- 

*6‘0 

50 

3.02 

6-l 

Dibutyl- 

1 

O 

00 

00 

2.0, 

33‘  8 
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Still  following  Lamb,  for  the  process  A B one  has: 
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AV  i i i 

— - s f—  + — r1 

l r at  + r nw 


max  2 p RT  l[A]  [B] 


introducing  K = [B]/[A]  , and  il  the  concentrations  of  A and  B are  expressed  in  mole 

g 

fraction  units,  [A]  + [ B]  = 1: 


AV 

TT S 


K 


^max  - 2(3S  RT  (1  + R)Z 


(5) 


It  is  now  assumed  that  AV.^,  = 0 (see  Ref.  9)  for  an  isomerization  reaction  so  that 

AV  = -0AH°/pc  . Substituting  this  approximation  for  AV  and  (3  = l/pu^  into  Eq.  (5) 

s p s s 

one  obtains: 
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where  again  for  an  isomerization  reaction  it  has  been  assumed  that  AS  ~ 0 so  that 
AG°  = AH°.  Modifying  slightly  Eq . (6)  one  arrives  at: 


Tu 


t L MI  , AH° 

2 2 RT 

Pc 

P 


■ ah°/rt 


(l*e-AH°/RT,2 


(7) 


The  function  x e /(I  + e X)^,  with  x = AH°/RT,  has  a maximum  at  x = 2.  4.  This  means 
that  for  x > 2.4  or  AH°  > 2.4RT,  the  quantity  Pmax/Tu^  will  increase  by  decreasing  x 
or,  in  other  words,  by  increasing  T.  This  behavior  has  been  verified  for  all  the  car- 
bonates investigated  which  implies  that  AH°  > 1 . 42  kcal/mole  at  T = 298.15°K. 

9 


-x.2 


Now,  in  Eq.  (6)  one  may  approximate  (1  + e 
log  of  both  sides  of  the  equation,  one  may  write: 


AH°/RT. 2 _ , .... 

) = 1 and,  taking  the  natural 


*"(Tp_v/u")  = m[(f  -2-jM 

pc 


AH 


o2 


•)]  - ah°/rt 


(8) 


Neglecting  the  temperature  dependence  of  the  first  term  on  the  right,  one  would  expect 
a plot  of  *n(TRmax/u  ) versus  l/T  to  be  linear  with  a slope  = -AH°/R.  This  is  actually 
the  case  as  seen  from  Figure  6.  The  values  of  AH°  obtained  are  presented  in  Table  III. 
For  dimethyl  carbonate  AH°  = 2 . 8 5 kcal/mole  which  agrees  within  the  experimental 
error  with  the  value  of  2.6  — 0.5  kcal/mole  obtained  by  infrared  spectra.^  The  agree- 
ment is  fairly  remarkable  considering  the  approximations  involved  in  Equation  (8). 

Table  III  also  reports  the  values  of  K = e /RT  ^ may  be  seen  that  K « 1 in  all 
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cases,  showing  the  presence  of  one  of  the  two  molecular  configurations  in  large  excess 
with  respect  to  the  other.  Furthermore,  the  values  of  AH^  are  comparable  for  the 
various  carbonates  studied.  This  indicates  a similar  energy  barrier,  or,  in  other 
words,  the  same  kinetic  process  is  operative  for  all  the  liquids. 


2.  Dielectric  Relaxation 

From  the  relaxation  frequencies  f^  one  may  calculate  relaxation  times  t * = 2xTf. 

which  in  turn  can  be  converted  into  microscopic  relaxation  times  t by  the  Powless- 

12  m 

Glarum  expression  (among  other  options): 


2e  + € 

Q 00 

3c 


r may  be  related  to  a frictional  coefficient  (which  is  associated  to  hydrodynamic  fric- 
m 

tional  forces)  by  the  equation  r = £ /2kT.  For  a spherical  dipole  of  radius  a.,  rotating 

m •} 

in  a medium  with  bulk  viscosity  rj , 4 = 8xra  r)  (see  Ref.  7)  or 


T 


m 


4xx  a' 
kT 
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3v 

= kT  71 


where  v is  the  volume  of  the  spherical  dipole. 

The  values  of  t and  ri  found  in  Table  II  have  been  plotted  in  the  form  of  t ver- 
m m 

sus  r|  in  Figure  7.  If  the  volume  of  the  rotating  dipolar  specie  is  constant  for  the  series 
of  non-cyclic  dialkylcarbonate  studied,  then  the  plot  should  be  linear  with  slope  = 3v/kT. 

A roughly  linear  plot  is  obtained  by  linear  regression  (correlation  coefficient  r =0.987). 
This  corresponds  to  a rotating  volume  v = 1.4o^xl0  cm  orV  = vL  = 8.93  cm  /mole 
(L,  Avogadro's  number).  These  numbers  are  about  one  order  of  magnitude  smaller  than 
the  molar  or  molecular  volume  of  the  carbonates  as  computed  from  the  expressions: 

V = M/p,  v = M/Lp,  where  M is  the  molecular  weight  (in  grams/mole)  and  p the  density 
of  the  corresponding  carbonate  (Table  II). 

One  would  then  be  tempted  to  conclude  from  the  above,  that  does  not  refer  to 
molecular  tumbling  of  the  whole  carbonate  molecule  but  only  of  a part  of  it.  Similar 

13 

conclusions  have  been  reached  by  authors  dealing  with  the  interpretation  of  the  dielec- 
tric relaxation  of  other  compounds  containing  the  alkoxy  group,  namely,  the  esters. 

1 4 

On  the  other  hand,  it  has  been  noted  that  the  application  of  the  Debye  hydro- 
dynamic  model  of  dipole  reorientation  leads  to  unreasonably  small  values  of  the  molecular 
radius.  Modification  of  the  Debye  relationship  by  multiplying  the  bulk  viscosity  by  a 
factor  f of  the  order  of  0.  1-0.2,  according  to  Gierer  and  Wirtz^  gives  radii  of  the  ex- 
pected order  of  magnitude.  Rotational  correlation  times  from  NMR  spectra  have 
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Fig.  7.  Correlation  between  the  inverse  of  the  dielectric  relaxation 
times  t”  ^ and  the  number  of  carbon  atoms  of  the  alkoxy 
chains  for  liquid  carbonates  and  esters.  Inset:  Molecular 
dielectric  relaxation  time  t vs.  the  viscosity  q for  the 
liquid  carbonates  investigated  at  25°C. 

also  been  correlated  to  the  bulk  viscosity  by  McClung  and  Kivelson^  with  the  introduc- 
tion of  a similar  factor. 

There  is,  however,  some  further  evidence  from  Fig.  7 which  would  seem  to  favor 
the  interpretation  of  the  relaxation  process  as  segmental  motion  rather  than  of  whole 
molecule  tumbling. 

In  Fig.  7 is  shown  the  plot  of  the  reciprocal  of  the  dielectric  relaxation  time  t * 
versus  the  number  of  carbon  atoms  in  the  alkyl  chain  of  the  alkoxy  group.  One  sees  a 
decreasing  change  in  t ' as  the  number  of  carbon  atoms  is  increased.  This  is  more 
evident  for  the  esters,  given  the  much  higher  chain  lengths  investigated.^  This  behavior 
suggests  that  the  observed  relaxation  times  refer  to  a segmental  motion  of  the  molecule, 
probably  involving  the  alkoxy  groups.  However,  by  elongating  the  latter  over  a certain 
number  of  carbon  atoms,  the  relaxation  times  tend  to  become  constant  due  to  nonpartici- 
pation in  the  motion  of  carbon  atoms  of  the  alklyl  chain  far  away  from  the  rotating  groups.'^ 
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3 .  Comparison  Between  Ultrasonic  and  Dielectric  Relaxation  Processes 

The  factor  of  1 - 2x  10^  at  T = 298. 2°K  between  ultrasonic  and  dielectric  relaxa- 
tion times  suggests  that  the  processes  studied  by  the  two  methods  correspond  to  two 
different  rate  constants  T * = k,  or,  in  other  words,  to  two  different  molecular  motions. 
The  good  correspondence  between  the  AH°'s  calculated  from  the  ultrasonic  and  the  IR 
spectra  for  the  dimethyl  carbonate  identifies  the  ultrasonic  process  as  a cis-trans 
isomerization.  Ultrasonic  relaxation  frequencies  found  for  the  carboxylic  esters^  are 
similar  to  the  ones  reported  here  for  the  non-cylic  dialkyl  carbonate.  These  ultrasonic 
processes  for  the  esters  have  also  been  ascribed  to  a cis-trans  isomerization.  The 

dielectric  relaxation  frequencies  for  the  esters  differ  from  the  ultrasonic  relaxation 

3 

frequencies  also  by  a factor  of  the  order  of  10  . It  would  seem  difficult,  therefore,  to 
escape  the  conclusion  that  the  description  of  both  the  dielectric  and  ultrasonic  relaxa- 
tions of  esters  and  carbonates  in  terms  of  the  same  cis-trans  isomerization  cannot  be 
correct. 

It  seems  likely  therefore  that  the  dielectric  data  reflect  an  intramolecular  motion 
of  the  predominant  molecular  form  under  the  influence  of  the  electric  field  without 
necessarily  causing,  upon  rotation  of  the  polar  groups,  the  appearance  of  the  energetical- 
ly unfavored  species  which  is  formed  upon  the  passage  of  the  mechanical  waves. 

D.  Saar,  J.  Brauner,  H.  Farber  and  S.  Petrucci 
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DIELECTRIC  RELAXATION  OF  SOME  1:1  ELECTROLYTES  IN  TETRAHYDROFURAN 
AND  DIETHYLCARBONATE 

D.  Saar,  J.  Brauner,  H.  Farber,  S.  Petrucci 

Complex  permittivities  in  the  frequency  range  of  0.3-67  GHz  are  reported  for 
the  systems  LiNO^  0.  1M,  Na-picrate  0.  05M,  Bu^NNO^  0.  1M  in  THF  at  25°C  and  for 
the  systems  LdClO^  0.  1M,  and  LiSCN  0.  1M  in  diethylcarbonate  (DEC)  at  25°C.  The 
permittivities  of  the  electrolyte  solutions  can  be  described  within  the  experimental 
error,  by  the  sum  of  two  Debye  relaxation  processes.  The  high  frequency  process  is 
comparable  to  the  one  observed  for  the  pure  solvents  while  the  low  frequency  process 
is  due  to  the  presence  of  the  electrolytes. 

Apparent  dipole  radii  a^  for  the  electrolytes  in  the  THF  solutions  are  calculated 

from  the  Debye  relation  using  the  macroscopic  viscosity  of  the  solvent.  Charge  to 

charge  separation  distances  a are  calculated  from  the  BSttcher  relation  which  relates 

the  change  in  dielectric  strength  (eo“e0Cj)*  for  a dipolar  relaxation  process,  with  the 

apparent  dipole  moment.  These  two  parameters  a_T  and  a_  determined  for  the  present 

systems  and  two  previous  systems  (LiClO.  and  NaClO  in  THF)  are  linearly  related  to 

2 4 ’ 

each  other  (correlation  coefficient  r = 0.89).  The  a 's  also  correlate  linearly 

2 T 
(r  =0.91)  with  the  sums  of  the  crystallographic  radii.  These  relations  reinforce  our 

previous  conclusions  that  the  solute  relaxation  process  in  THF  is  mainly  due  to  the 

rotation  of  ion-pair  dipoles. 

For  the  DEC  solutions  the  appearance  of  a single  relaxation  process  for  the 
solute  (instead  of  a distribution  of  relaxation  times  as  the  presence  of  two  molecular 
species  would  suggest)  is  discussed  in  terms  of  the  structure  of  the  quadrupoles. 

Recent  Raman  and  IR  spectra  in  the  literature  suggest  a centrosymmetric  structure 
for  (LiSCN)^  in  ethers.  This  structure  if  present  for  the  carbonate  solutions  would 
predict  the  absence  of  a quadrupole  dielectric  relaxation  process  in  accord  with  our 
results  in  dimethylcarbonate  (DMC)  and  DEC. 

A.  Introduction 

In  the  past,  complex  permittivities  of  electrolytes  in  solvents  of  low  permittivity 

have  been  measured  in  order  to  characterize  the  dynamic  behavior  of  the  electrolytes 

under  the  influence  of  microwave  electric  fields  and  to  study  the  relaxation  behavior  of 

the  solute  molecular  species.  The  general  picture  which  had  emerged  from  the  previous 

12  3 

studies  carried  out  in  this  laboratory  ’ ’ was  that  the  solute  dielectric  relaxation 

process  was  mainly  caused  by  the  rotational  relaxation  of  the  ion-pair  dipoles.  These 
species  were  also  present  in  the  largest  concentration.  However,  these  studies  were 
confined  to  some  alkali  perchlorates  in  THF  (Refs.  I,  2)  and  to  lithium  salts  in  DMC 
(Reference  3.) 
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It  was  of  interest  to  try  to  generalize  the  above  conclusions  for  electrolytes  of 
different  nature  and  structure  in  the  same  solvent.  Therefore,  we  have  now  extended 
our  studies  to  include  LiNO^,  Na-Picrate,  and  Bu^NNO^  in  THF  in  the  same  concen- 
tration range  as  used  previously. 

The  study  of  Li+  salts  in  DMC  had  shown  a correlation  between  the  change  in 

the  dielectric  strength  of  the  system  due  to  the  solute  relaxation  process  and 

the  calculated  concentration  of  ion-pair  dipoles,  C . It  was  of  interest  to  extend  this 

P 4 

work  to  another  solvent,  namely  DEC.  It  has  been  shown  that  although  DEC  is 
practically  isodielectric  with  DMC,  Li+  salts  form  ion-pair  dimers  or  quadrupoles 
to  a much  larger  extent  in  DEC.  To  this  end  we  have  studied  LiC  10^(0.  1M)  and 
LiSCN(0.  1M)  in  DEC  by  dielectric  relaxation  methods  to  ascertain  whether  the  struc- 
tural information  inferred  by  Raman  and  IR  spectra  is  reflected  by  the  molecular 
dynamics  of  the  systems. 

B.  Experimental  Part 

The  equipment  and  procedure  have  been  described  elsewhere.  ^ THF  (Fisher) 
was  distilled  under  nitrogen  over  liquid  potassium  with  only  the  middle  portion  col- 
lected. DEC  (Aldrich)  was  distilled  under  reduced  pressure.  The  LiNO^  (Fisher) 
was  dried  in  an  oven  at  101°C  to  constant  weight.  The  NaPicrate  (Kodak)  and  Bu^NNO^ 
(Kodak)  were  dried  under  - 0.  1 torr  at  30°  for  one  week.  The  LiC  10  . and  LiSCN  were 

3 4 

purified  as  described  previously. 

C.  Results 

Figures  1 and  2 show  the  real  part  of  the  complex  permittivity  e'  and  the  imaginary 
part  of  the  complex  permittivity  e"  plotted  versus  frequency  f.  Both  figures  include  a 
Cole-Cole  plot  of  €*-  vs.  where  e*  represents  the  (dielectric)  loss  due  to  ionic 
conduction.  The  data  are  for  the  systems  LiN0^(0.  1M),  NaPicrate  (0.05M)  and 
Bu^NN0j(0.  1M)  in  THF  at  25°C.  The  filled  in  points  at  f=  137  GHz  are  literature 
data  for  pure  THF  (Reference  5.) 

In  both  figures  the  solid  lines  represent  the  functions  e/  and  e"  vs,  f as  calculated 
from  the  sums  of  two  Debye  functions: 

' <VC.1>(f/fR|>/[l*(t/fRi)i  * <c-,-W<'/fR2>/|>((/fR2>2]  • o> 


In  the  above  e is  the  static  permittivity,  e . is  the  permittivity  at  (f»  fR  ),e  - is 
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Fig.  2.  Real  part  of  the  complex  permittivity  e'  and  imaginary  part  of  the  com- 
plex permittivity  e”-££  plotted  vs.  the  frequency  f(GHz)  for  BU4NNO3O.  1M 
in  THF  at  250°C.  Cole-Cole  plot  of  e"-e£  vs.  c'  for  the  same  system. 

The  filled  points  of  f=137  GHz  refer  to  literature  data  for  pure  THF  (Ref.  4). 

the  high  frequency  permittivity  (f»  fj^),  an<*  ^Rj  ar>c*  *R^  are  t*1e  relaxati°n  frequencies. 

The  parameters  £u>  Ew  j « E 1 > ^R  1 anc*  ^R^  are  S^ven  En  Table  I,  together  with  the  fitting 

procedure . 

is  calculated  from  the  expression  = 1 . 8x10  X/f  where  the  conductivity  of  the 
solutions,  X,  was  measured  separately  at  25.00°C.  The  conductivity  apparatus  consisted 
of  a General  Radio  impedance  comparator  (equipped  with  an  impedance  reference  arm), 
a Kraus  conductance  cell,  and  a precision  thermostat  as  described  earlier.’’  The  values 
of  X are  also  given  in  Table  I. 


■*  * 
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g \ 

TABLE  I.  'Calculated  relaxation  parameters  eQ,  Cooj,  £002. 

fj^j  and  according  to  equation  I and  II  for 

LiN030.  1M,  NaPicrate  0.  05M  and  Bu4NN030.  IM 

in  THF  at  25°C.  X is  the  solution  conductivity  at 
audiofrequencies  (1000  Hz). 


Solvent 

Electrolyte 

€ 

O 

Eool 

e<*>2 

fRl 

(GHf) 

'R2 

(Ghf) 

X 

fl  ^ cm  1 

THF 

LiN03  0.  1M 

8.0 

7.2 

2.4 

2.5 

60 

6.83xl0-6 

THF 

NaPicrate  0.  05M 

8.4 

7.3 

2.4 

1.0 

55 

1.33xl0‘5 

THF 

Bu  .NN0  0.  1M 

4 3 

9.05 

7.4 

2.4 

0.  75 

55 

2.05xl0"4 

e 

e 

f„ 

O 

00 

R 

DEC 

— 

— 

2.84 

2.31 

— 

16 

— 

€ 

O 

Eool 

eoo2 

s 

fR 

R2 

DEC 

LiClO.  0.  1M 

4 

3.75 

3.0 

2.35 

0.9 

14 

— 

LiSCN  0.  1M 

3.35 

2.90 

2.35 

0.8 

12 

— 

'00  > *R  have  been  obtained  by  programming 


a)  The  parameters  e„  , e— 

°1  “l  ~2  ~1  “,v2 

equations  1 and  2 with  variable  values  of  the  above  parameter  as  to  have  a 

minimum  in  the  expression  (S  | e(^a3c  ~ eexp I + ^1  Ccalc~  Gexp^‘  This  procedure 
was  also  described  previously  (Reference  3). 


In  addition,  the  parameters  e , and  f^  used  to  fit  the  data  for  pure  DEC 

via  one  term  Debye  relaxation  functions,  (Eqs.  (1)  and  (4))  are  presented  in  Table  I, 


e/=eoo+(Go-eco)/C1  + (f/fR)^ 

(3) 

€'=(eo-eJ<£/fR)/t1+<£/fR)2] 

(4) 

The  fitting  of  the  DEC  solvent  by  a single  relaxation  function  is  adequate  up  to  about 
35  GHz  with  positive  deivations  appearing  at  high  frequency  due  to  an  absorption  proces 
in  the  submillimeter  near  infrared  region  of  wavelengths.  Details  of  this  aspect  will 


SOUD  STATE  AND  MATERIALS 


217 


be  described  in  a later  paper.  For  the  time  being,  our  interest  being  centered  on 
solute  relaxation,  we  will  ignore  the  systematic  deviations  of  e"  at  50  - 70  GHz. 

Figure  3 shows  the  quantities  e ' and  e*  versus  the  frequency  f and  t"  versus  e ' 
for  LiClO  j (0.  1M)  and  LiSCN(O.lM)  in  DEC  at  25°C.  The  solid  lines  correspond  to 
values  calculated  from  Eqs.  (1)  and  (2)  where  e"  has  been  neglected.  The  associated 
parameters  are  reported  in  Table  I. 


D.  Discussion 


5,7 


Literature  ’ reports  assigns  a single  Debye  type  relaxation  process  to  pure 
THF  with  parameters  eQ  = 7. 36,  = 2.  30,  a id  iR  = 62.  9 GHz . From  Figs.  1 and  2, 

and  Table  I it  is  evident  that  the  high  frequency  relaxation  process  in  the  THF  solutions 
can  be  associated  with  the  solvent  molecules.  The  second  relaxation  process  occurring 
at  lower  frequencies  in  the  solutions  is  specific,  both  in  terms  of  the  change  in  dielec- 
tric strength  eo'e00j  and  relaxation  frequency  for  the  electrolyte  considered. 

In  the  following  we  will  relate  apparent  dipolar  radii  obtained  from  the  relaxation 
parameters  and  from  the  relaxation  time,  t,  [where  T = (2TTfR)~  ],  to  each 

other  and  to  radii  calculated  from  crystallographic  data.  These  relationships  should 
clarify  the  nature  of  the  molecular  process  associated  with  the  dielectric  relaxation 
attributed  to  the  solute. 


First,  we  present  equations  which  will  permit  the  calculation  of  a^ , the  apparent 
distance  between  the  centers  of  unit  positive  and  negative  charge  for  the  ion-pair  dipole 

where  a = |u/e,  M is  the  apparent  dipole  moment  of  the  ion-pair  and  e is  the  electron 

^ 3 

charge.  The  apparent  dipole  moment  n is  calculable  from  the  BSttcher  expression 


4nLCpx  10‘3  3 eQu2 
(1  -Of)*l2€  + 1)  3kT 


(5) 


where  Cp  is  the  concentration  of  the  ion-pair  dipoles,  L is  Avogadro’s  number,  f is 

the  reaction  field  factor,  and  Cl  the  polarizability.  To  use  Eq,  (5)  we  need  to  calculate 

CP  which  can  be  done  from  values  of  the  equilibrium  constants  for  ion-pair  formation 

and  triple-ion  formation,  Kp  and  K_,  respectivelv.  The  values  Kp  and  K_  are  in  the 
9 1 T 

literature  and  are  presented  in  Table  II. 

The  equilibria  involved  are: 


A-f  B"  = 

— ) 

AB 

Kp 

AB  + A+  — 

7 

a2b 

K^. 

AB' 

Kt 

7 
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TABLE  II.  Values  of  ion  pairs  and  triple  ions  formation 

constants  Kp  and  K-p  in  the  THF  taken  from  the 
literature;  concentration  of  ion  pairs  Cp  = ( 1 - 0 -3  o^.)C; 
charge  to  charge  separation  au  calculated  according 
to  Debye;  sum  of  the  crystallographic  radii  (r+  4 r_) 
taken  from  the  literature  and  from  molscular  models. 


Electrolyte 

C 

K 

p 

kt 

Ref. 

Cp 

au108 

aTxl08 

(r++r_)108 

Ref. 

in  THF 

M 

M-l 

M-l 

M 

cm 

Li  C 1 3 . 

4 

0.  05 

4. 84x10 ' 

153 

a) 

0. 0492 

3.  11 

4.  18 

2.  25 

d)e) 

NaCIO . 

•4 

9.  1 

9. 93xl07 

150 

b) 

0. 0985 

2.  63 

3.  78 

2.  62 

e) 

NaC 1 D 4 

4 

0.  05 

9. 93x1 07 

150 

b) 

0. 0495 

3.  02 

3.  93 

2.  62 

e) 

LiN03 

0.  10 

590x1 3 7 

132 

-) 

0. 0993 

2.  00 

3.53 

1.82 

0 

NaPicrate 

0.  05 

— 

... 

- 

(0.5) 

4.  36 

4.  7 J 

4.  5 

8) 

Bu^NNO  3 

0.  10 

0.  7c 107 

155 

c) 

0. 0933 

4.  10 

5.  22 

6.  16 

h) 
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-5  + 

d)  0.  oO  A is  the  radius  of  the  Li  ion. 

o + o 

e)  0.97  A is  the  radius  of  the  Na  ion;  1.65  A is  the  Cl-0  distance  in  the 

perchlorate  ion. 
o 

f)  1.22  A is  the  N-0  distance  in  the  nitrate  ion. 

o 

g)  3.  5 A is  the  calculated  radius  of  the  picrate  ion  from  molecular  models 
(distance  from  the  oxygen  to  the  center  of  the  benzene  ring). 

+ ° 

h)  N =4.94  A,  from  R.  A,  Robinson  and  R.  H.  Stokes,  "Electrolyte 
Solutions,"  Butterworths,  2nd  Ed.,  page  125  (1959). 

Now  Kp  = (1  - o)/ 0 C where  o is  the  degree  of  dissociation  of  the  pairs  and  C is  the 
stoichiometric  concentration  of  the  electrolyte.  Similarly,  ^ 

[ab2]  [a2b]  otc 

T [AB]  [A]  [AB][B]  [(l-0-3ox)C](oC) 

where  a,p  is  the  fraction  of  ion-pairs  which  form  triple  ions  according  to  the  equilibria 
for  triple  ion  formation  shown  above.  Oviously  then,  Cp  = [AB  ] = (l-o-3o,p)C. 
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The  Op's  were  calculated  for  all  the  THF  solutions  except  the  Na-picrate  solution 
where  no  data  for  Kp  and  are  avalable.  In  the  latter  case  Cp  was  assumed  equal 
to  C,  the  total  solute  concentration.  These  values  are  presented  in  Table  II. 

2 

The  (1  -af)  factor  in  Eq.  (5)  was  neglected  which  implies  a zero  polariability 
or  a rigid  sphere  model  for  the  ions  in  the  ion-pair  dipole.  Thus  in  the  rigid  sphere 
model  for  the  constituent  ions  is  equal  to  a = H , the  charge  to  charge  separation 
distance.  The  values  of  the  p's  and  u^'s  are  reported  in  Table  II. 

As  an  alternate  method  for  the  calculation  of  an  apparent  radius  for  the  relaxing 
dipole  we  use  Debye's  equation 


, 4TTaT 

1 =~kf“ 


(6) 


where  a^  is  the  radius  of  a spherical  dipole  immersed  in  a continuum  of  viscosity  T| 
and  t'  is  the  microscopic  relaxation  time.  This  can  be  related  to  the  experimental 
relaxation  time  t by  the  Powless  Glarum  expression^  t' = [(2eQ+ ]t*  The 
values  of  aT  calculated  from  Eq.  (6)  are  given  in  Table  II. 

Figure  4 shows  the  values  of  a^  versus  a^.  The  solid  straight  line  is  that  calculated 
by  a linear  regression  analysis  with  a correlation  coefficient  r2  = 0.89.  We  would  not 


£ 

o 
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O 

X 

I* 

o 


Correlation  between  Debye  relaxation 

Correlation  between  Debye  relaxation 

radii  and  radii  calculated  from 
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electrolytes  in  THF  at  25°C. 
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CO 

o 

A K 

4 K» 
o 
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Fig.  4.  Correlation  between  ax»  the  dipole  radius  calculated  from 
the  relaxation  time  and  ay  the  charge  to  charge  distance 
calculated  from  the  apparent  dipole  moment.  Correlation 
between  aT  and  the  sum  of  the  crystallographic  radii  {r  +r  ). 
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expect  a slope  of  one  in  this  plot  in  view  of  the  crudeness  of  the  models  assumed  and 
the  fact  that  a^  is  derived  from  a property  related  to  the  distribution  of  charge  in  the 
on-pair  while  a^  is  derived  from  a property  related  to  the  rotation  of  the  ion-pair  in 
the  solution.  The  ion-pair  may  be  solvated  to  some  extent  and  this  would  modify  the 
size  of  the  rotating  species.  In  spite  of  all  this  a correlation  as  shown  in  Fig.  4 does 
emerge. 

Further  evidence  that  the  solute  relaxation  process  is  due  to  the  rotational  relaxa- 
tion of  ion-pair  dipoles  is  given  by  the  linear  relationship  between  the  a^'s  and  the 
sums  of  the  crystallographic  radii  (r+  + r ) also  shown  in  Figure  4.  The  solid  straight 
line  was  calculated  :rom  a linear  regression  analysis  with  a correlation  coefficient  of 
r2  = 0.91. 


Attempts  to  rationalize  the  observed  correlation  between  a^  and  the  sum  of  the 
crystallograpliic  radii  in  terms  of  a libratio.o  of  an  ionic  lattice  in  the  alternating  elec- 
tric field  of  the  microwaves,  while  not  impossible,  is  not  likely  to  be  justified.  Ex- 
planations of  this  sort  would  be  more  reasonable  at  higher  concentrations  that  those 
studied,  where  the  pair  to  pair  distances  would  become  the  same  as  the  interionic 
distances  in  a single  pair.  At  the  maximum  concentration  studied  here,  C = 0.1M, 

(neglecting  other  species)  the  volume  associated  with  a single  pair.  V = l(CxlO'^L)  = 

-20  343  ° 

1.66x10  “ cm  . Taking  V = — TT r this  volume  corresponds  to  ^ 1 6 A where  2r  repre- 
sents the  minimum  distance  between  the  centers  of  a given  pair  and  some  adjacent  pair. 
The  value  of  the  calculated  2r  is  between  7 and  16  times  the  values  of  a^  quoted  in 
Table  II. 


In  a previous  paper  we  reported  a roughly  linear  correlation  between  the  quantity 
and  Cp  3€Q/2eo+  1 for  LiBr,  LiSCN,  and  LiClOj  in  DMC  in  accord  with  Equation 
(5).  In  that  correlation  the  contribution  to  £ -e  , due  to  rotational  relaxation  of  any 

o ooi  ‘ 

ion-pair  dimers  (quadrupoles)  was  neglected  because  of  the  small  d.pole  moments  of 
these  dimers  for  LiBr  and  because  of  their  low  concentration  for  LiClO^  (Reference  4). 

For  the  present  systems  of  LiSCN  and  LiClO.  in  DEC,  the  association  to 

quadrupoles  should  oe  more  extensive  and  the  dipole  moments  of  these  quadrupoles 

4 

should  be  relatively  large,  as  indicated  by  the  values  of  6^.  The  o^'s  represent  the 
contribution  per  unit  concentrations  of  the  quadrupoles  to  the  increase  of  the  permittivity 
of  the  solution  with  respect  to  the  solvent,  Ae.  Then  it  would  appear  that  the  approxi- 
mation of  neglecting  the  contribution  of  the  quadrupoles  in  the  quantity  e -C^,  is  no 
longer  tenable  for  the  present  systems. 


Still  it  is  not  possible  to  detect  the  presence  of  more  than  one  single  Debye  relaxa- 
tion process,  within  experimental  error,  for  0.  1M  LiSCN  and  LiClO^  in  DEC.  It  is 
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not  conceivable  that  the  two  species  ion-pair  dipoles  and  quadrupole-dimer s have  the 
same  relaxation  time.  On  the  other  hand,  the  presence  of  the  latter  species  has  been 
detected  by  IR  spectra  (-2040  cm-1  band).13 

Since  the  parameter  e ^ is  comparable  for  the  electrolyte  solution  and  the  solven 

DEC  (and  the  size  of  the  quadrupoles  is  larger  than  the  one  of  the  dipoles),  one  might 

suspect  that  the  rotational  relaxation  of  these  species  occur  at  lower  frequency  than 

our  lowest  experimental  frequency.  That  this  suspicion  is  unfounded  is  easily  provable 

by  recalculating  the  static  E^'s  (at  f-2  MHz)  from  Chabanel's  work1  and  comparing  it 

with  our  calculated  e 's. 

o 

4 

Menard  and  Chabanel  defined  the  dielectric  effect  As 


As  = 6 C -l  6 C 
? P q P 


(V) 


where  AS  - e - S with  € the  permittivity  of  the  solvent  and  C And  C are  the  concen- 
u b s p q 

tration  of  the  pairs  and  quadrupoles  respectively.  6^  and  5^  are  the  dielectric  incre- 
ments due  to  the  presence  of  unit  concentrations  of  the  two  species. 

The  calculation  of  As  is  possible  from  the  tabulated1  6^,  6^  and  (Ref.  4), 
where  refers  to  the  equilibrium: 


2 [AB  ] !>2B2] 


1 - 0 


o /2 

q 


(8) 


K = 

q 


o / 2 
q 


C(l-Oq)‘ 


and 


V(1-°q)C’ 


u 

-f 


Table  III  reports  the  results  of  this  calculation  for  LiSCN  and  LiClO.  in  DEC  as  well 

4 

as  for  LiBr,  LiSCN  in  DMC  (the  values  for  K and  6 and  6 for  LiClO,  in  DMC  were 

4 q P q 4 

not  reported.  It  is  evident  that  the  two  sets  of  £ o's , the  static  and  :he  extrapolated 
ones  are  within  experimental  error. 

The  absence  of  a dielectric  relaxation  spectrum  for  the  quadrupoles  can  be 
rationalized  by  the  possibility  that  dipole  moment  is  negligible  or  zero  at  variance 
with  calculated  values.1  Recently,  Chabanel  et  al.  11  have  obtained  the  IR  and  Raman 
spectra  of  LiSCN  in  various  ethers.  They  invariably  detect  a lowering  of  the  force 
constant  in  the  C-N  bond  of  the  3CN  ion  when  the  ion-pairs  dimerize  to  form 
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TABLE  III.  Calculation  of  Ae  = e0-es=6p  Cp+6„  Cq  for 
LiSCN  0.  1M,  LiC104  0.  1M  in  DEC  and  for 
LiBr  0.  1M  and  LiSCN  0.09M  in  DMC.  es 
is  the  solvent  permittivity.  Comparison 

between  e.  at  2MHz  from  Ae  and  e from 
o o 

the  microwave  data 


Solvent 

Electrolyte 

c 

p 

6 

P 

C 

P 

0 

q 

Ae 

e 

o 

extrap^ 

o 

DEC 

0.  1M  LiSCN 

0.  017 

14.3 

0. 0414 

8.0 

o.  5 n 

3.  42 

3.35 

DEC 

0.  1M  Li C 1 0 

4 

0.021 

14.7 

0.0395 

15.0 

0.902 

3.74 

3.75 

DEC 

0.  1M  LiBr 

0.  021 

8.7 

0. 0395 

1.8 

0.254 

3.  34 

3.45 

DEC 

0.  09M  LiSCN 

0.037 

14.  6 

0. 0260 

10.8 

0.830 

3.92 

3.95 

(a)  e = A 
x ' o 

e+e  , e (DEC) 
s s 

= 2.84; 

es(DMC) 

= 3.09 

N = C = S 


(b)  Data  from  Table  1 and  Reference  3. 

quadrupoles.  In  fact,  the  "free"  SCN  antisymmetric  stretching  frequency  at  2060  cm  * 

is  lowered  to  2040  - 2030  cm  . Similarly,  the  IR  active  C-S  stretching  frequency  is 

increased  from  735  cm  * in  the  "free”  ion  (which  occurs  in  solvents  such  as  dimethvl- 

formamide)  to  792  cm  * on  quadrupole  formation  in  dimethoxymethane.  In  contrast, 

the  formation  of  the  simple  ion-pair  LiSCN  results  in  increases  in  both  the  C-N  and 

C-S  stretching  frequencies  with  respect  to  the  free  SCN-  ion.  The  decrease  of  the 

force  constant  in  the  C-N  bond  and  the  simultaneous  increase  in  that  of  the  C-S  bond 
1 4 

had  been  interpreted  as  being  due  to  the  formation  of  a centrosymmetric  quadrupole 
^-Li-. 

S = C = N.  'N=C  = S 

' ' Li 

By  symmetry  the  dipole  moment  of  this  species  would  be  zero. 

Unfortunately,  the  carbonate  solvents  absorb  in  the  same  region  as  the  C-S 
stretch  in  the  thiocyanate  ion  which  precludes  the  study  of  the  thiocyanate-carbonate 
system  in  this  region.  However,  for  the  C-N  stretching  frequency  IR  data*^  indicate 
the  presence  of  quadrupoles  in  carbonate  solvents,  the  band  occurring  at  2040  cm"'. 
(Raman  spectra  have  confirmed  this  band  although  at  2052  cm  * (Reference  3)).  Addi- 
tion of  a high  permittivity  solvent  such  as  dimethylformamide  to  the  LiSCN-ethylacetate 

- 1 13 

system  also  causes  this  IR  band  at  2040  cm  to  disappear. 

If  the  ion-pair  dimers  or  quadrupoles  that  are  formed  in  the  carbonate  solvents 
are  centrosymmetric  and  thus  have  a zero  d.pole  moment  as  is  indicated  n the  case 


224 


SOLID  STATE  AND  MATERIALS 


j 


!f 


of  the  ether  solvents,  then  the  dielectric  data  for  the  LiSCN  and  LiClO^  in  DEC  would 
find  a complete  rationalization  for  their  showing  a single  relaxation  process.  This  then 

4 

is  to  be  attributed  to  the  ion-pair  dipoles  as  in  the  previous  case  for  the  DMC  solvent. 

According  to  the  report  of  one  of  the  referees',  Chabanel  et  al  have  recently 
arrived  at  the  same  conclusion.  The  apparent  inconsistency  between  IR(pq  = 0)  Eq. 

(14)  and  dielectric  (ftq^O)  measurements  Eq.  (4)  was  puzzling.  A quantitative  IR 
study  of  LiSCN  solutions  in  DEC  and  in  other  solvents  showed  that  the  activity  coef- 
ficients of  ion  pairs  cannot  be  neglected  in  the  0.01  -0.  1M  concentration  range.  Thus 
the  ideal  solution  model,  even  if  it  seems  to  work  well,  cannot  give  correct  6q  values. 
The  main  reason  is  that  these  values  arise  from  an  extragolation  to  infinite  concentra- 
tion. It  is  also  conclused  that  ftp  values  are  reliable  and  comparison  between  IR  and 
dielectric  measurements  show  that  6q  is  close  to  zero  for  LiSCN.  Consequently , 
quadrupoles  must  be  nonpolar  in  agreement  with  the  result  of  the  present  paper. 

D.  Saar,  J.  Brauner,  H.  Farber,  S.  Petrucci 
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CLASSICAL  WAVE- PARTICLE  DUALITY 
N.  Marcuvitz 

A.  Introduction 

A detailed  or  microscopic  analysis  of  wave  propagation  in  dispersive  nonlinear 
and/or  turbulent  systems  is  possible  only  in  special  cases  and  even  then  with  difficulty. 
However,  if  waves  are  localizable  in  space-time  as  either  coherent  quasi -periodic 
wavepackets  or  as  weakly  correlated  stochastic  waves,  then  many  average  or  macro- 
scopic features  of  such  wave  systems  can  be  analyzed  by  relatively  simple  analytical 
procedures.  1 These  "averaging"  procedures  for  the  analysis  of  wave  dynamics  often 
admit  simple  physical  interpretations  in  terms  of  the  dynamics  of  "particles"  of  dif- 
ferent types.  Such  interpretations  are  indicative  of  a wave  - "particle " duality  that  is 
well  known  in  quantum  dynamics  and  is  not  completely  unfamiliar  in  classical  wave 
dynamics  where,  at  least  at  linear  level,  it  is  often  expressed  in  "ray-optic" 
language.  ’ ’ In  quantum  mechanics  one  views  real  particle  motion  statistically  in 
terms  of  a probabilistic  wave  dynamics  that  on  the  average  reproduces  the  classical 
dynamics  of  the  particles.  Conversely,  in  appropriate  space-time  regions  one  can 
view  the  average  position,  wavenumber  spread,  etc.,  of  classical  wavepackets  in 
terms  of  the  dynamics  of  corresponding  "quasi -particle " systems.  "Particle"  systems 
may  be  described  in  terms  of  variables  that  are  either  kinetic  (velocity,  space  and 
time  dependent),  fluid-dynamic  (space  and  time  dependent),  or  macroparticle  (time- 
dependent)  with  increasing  roughness  of  description.  These  different  descriptions  are 
related,  of  course,  and  are  derivable  from  the  kinetic  by  taking  either  velocity,  or 
velocity  and  spatial  moments  of  the  kinetic  distribution  function.  The  macroparticle 
description  provides  a particularly  simple  means  for  illustrating  particle -wave  duality 
and  will  be  employed  both  to  review  a quantum  statistical  interpretation  of  real  particle 
motion  in  terms  of  probabilistic  waves,  and  conversely  to  describe  a localized  classical 
wave  field  in  terms  of  the  dynamics  of  a macro-quasiparticle. 

B.  Particle-Wave  Duality 

At  quantum  levels  vacuum  fluctuation  effects  must  be  considered  and  hence  the 
classical  motion  of  a real  particle  may  be  viewed  as  a statistical  average  of  a micro- 
scopic dynamics.  More  specifically,  if  a real  particle  of  unit  mass,  with  position  q 
and  momentum  p at  time  t is  described  by  a time  dependent  energy  hamiltonian 


2 

H(p,  q)  = + V(q) 

then  its  classical  motion,  defined  by  the  trajectory  equations, 

dcj.  dH  d£  = _ dH  dH 

dt  dp  ’ dt  dq  ’ dt  ’ 


U) 


(2) 


vrj 
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is  to  be  viewed  in  a statistical  sense.  Thus,  one  defines  a probability  density  |ij/(q,  t)  | 2 
of  finding  the  particle  at  position  q at  time  t,  with  the  complex  wave  function  satisfying 
the  probability  normalization  requirement 

+ oo 

1 = J + (q.  tH(q.  t)dq,  or1  i = 

- 00 

and  with  the  average  particle  position  at  time  t given  by 

+ oo 

r # 

q(t)  = J ^ (q,t)q^(q,  t)dq,  or  q = (4j,Q4<)  . (4) 

- 00 


The  right  hand  expressions  are  conventional  hermitean  inner  product  representations 
of  those  on  the  left;  Q is  a time  independent  "position  operator”  with  representative  q 
in  the  4;  basis.  Similarly,  one  defines  a probability  density  | <^>(p , t)  | 2 for  the  particle 
to  have  momentum  p at  time  t,  with  the  complex  wave  function  4>(p,  t)  also  satisfying  a 
probability  normalization  requirement 
+ °c 

C * 

1 = J <f>  (P.t)<t>(p,t)dp  , or  1 = {<(>,<(,}  , (5) 

- 00 


and  hence  with  the  average  particle  momentum  at  time  t given  by 
+ °° 

P(l)  = J <t>  (P.  t)p<Mp.  t)dp,  or  p=  {<j>,P,4>}  , 

- 00 

with  P a "momentum  operator"  whose  representative  in  the  basis  is  p.  The  bracketed 
expressions  on  the  right  of  Eqs.  (5)  and  (6)  are  evidently  hermitean  inner  product  repre- 
sentations of  those  on  the  left,  but  in  p space  rather  than  the  q space  products  in  Equa- 
tions (3)  and  (4). 

From  Eqs.  (3)  and  (5),  using  the  Fischer -Riesz  theorem,5  one  infers  the  existence 
in  q space  of  an  orthonormal  basis  ^(q.t)  with  respect  to  which  the  wave  function  <t>(p,t) 
is  the  transform  of  4<(q,t),  viz: 


+ 00 

Mq.t  )=/  <HP.  t)^  (q,t)dp 

- 00  ” 

with 


(6a) 


+ 00 


6(P-P ')  = / ip(q.t)ip/(q,t)dq,  or  &(p-p')  = (jj^,  jjw) 


(6b) 
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representative  of  the  orthonormality  properties  of  the  ^p(q,t).  One  observes  that  Eqs. 
(6a)  and  (6b)  imply  the  normalization  requirements 

(4b  4*)  = £<t>,  <t» } = 1 . (6c) 

Since  4>(p,  t)  is  the  transform  of  4<(q,t),  then  p4>(p,  t)  must  be  the  transform  of  some 
q-space  wavefunction  that  can  be  identified  as: 

+ °o 

P'Wq,  t)  = j f*j>(p,  t)  (q.t)dp  . (7) 

- 00 

This  identification  in  terms  of  the  momentum  operator  P follows  from  the  relation, 
deducible  from  Eqs.  (6)  and  (7), 

(4b  P 40  = {<)>,  P<(>}  = p(t)  . (g) 

where  the  as  yet  unknown  representative  of  the  operator  P in  the  4>  basis  is  to  be  dis- 
tinguished from  its  known  representative  p in  the  <{>  basis.  Incidentally,  one  notes  from 
Eq.  (7)  that  the  are  q-space  eigenfunctions  of  the  operator  P with  eigenvalues  p. 

Equations  (4)  and  (8)  not  only  identify  the  classical  position  q(t)  and  momentum 
p(t)  variables  as  probabilistic  averages  of  the  + wave  field,  but  also  lend  significance 
to  the  usual  quantum  mechanic  postulational  association  of  time  independent  operators 
and  classical  variables.  A further  observation,  following  from  the  reality  of  the 
classical  variables  is  that  these  operators  are  hermitean,  i.e., 

q(t)  = (4<,  Q 4»)  = (Q4b  4<) 

p(t)  = (4»,  P4<)  = (Pi|>,  41)  . ( 9a.) 

More  generally,  one  associates  the  energy  operator  H(P,  Q)  with  the  average  classical 
hamiltonian,  via 

H(P.  q)  = (4<.  H(P,  Q)  41)  h (4;,  H4>)  = (H4),  4j)  . (9b) 

A defining  equation  for  4>  may  now  be  determined  by  imposition  of  the  requirement  that 
the  classical  equations  (2)  are  "average"  equations,  whence  from  Eq.  (9b) 

ri(p> q)  = 0 =•  ($ ' H*)+  (*• H It1  ) <10> 

0=  (It1’  H+)+(H+'  It1)  ' 
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Equation  (10)  then  implies  that  (i  = imaginary  unit) 

H+  - if 

is  a possible  (but  not  unique)  defining  equation  for  4j.  In  quantum  mechanics  the  propor- 
tionality constant  is  the  Planck  constant  tr,  and  hence 

H(P,  Q)  4>  = tri  , (11) 

becomes  the  Schrodinger  equation  defining  the  wave  function  ij/(q,t).  However,  since 
the  operator  P is  as  yet  undefined  in  the  i|>  basis,  we  observe  from  the  interpretation 
of  Eqs.  (2)  and  (1)  as  averages  that 

f = pci 

whence  from  Eqs.  (9a)  and  (11) 

(lti*Q^)+  (^>Q|?i)=  P+> 

or 

(+.  + . 

Using  the  operator  expression  for  the  hamiltonian,  following  from  Eq,  (1),  one  has 
f|P2Q-QPZ)=P  , 
which  is  satisfied  if 

PQ  -QP  = - 

l 

and  hence  in  the  basis  one  infers  the  desired  P representation: 

P -»  p , and  Q -*  q . (12) 

Thus  the  Schr8dinger  equation  (11)  takes  the  more  explicit  form 

“(F*-«0*-F3- 

or  for  H defined  as  in  Eq.  (1), 

r nz  S2  . v.  .1  . IT  M, 
l'TTT  + V(^J+  T ft 


(13a) 


(13b) 
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defines  the  wavefunction  ^(q.t)  explicitly. 

From  the  above  derivation  of  Eq.  (13)  for  4<>  it  is  manifest  that  the  wave  field  i|j 
has  been  defined  so  that  the  averages  Eq.  (9a)  reproduce  the  real  classical  motion  de- 
fined by  Eqs.  (2).  Since  the  4<  field  contains  more  information  than  the  classical  equa- 
tions, one  can  deduce  from  other  (quantum)  properties  of  the  particle  motion  such  as 
indeterminacy  relations  for  the  mean  square  fluctuations  of  particle  position  and  mo- 
mentum, etc.  Also,  for  the  subsequent  discussion,  it  is  of  interest  to  note  that  if  one 
introduces  in  the  basis  time  independent  operators  Q,  P via: 

= («l».  6 «M  . +)  . 

then  from  Eqs.  (11),  et  seq.  , one  infers  dynamical  (operator)  equations  of  motion 
Q = [H,  Q] 

P = jf  [H.  P]  , (14) 

where  [H,  Q]  is  the  commutator  HQ  -QH,  etc.  Equations  (14)  are  of  interest  not  only 
in  the  time  dependent  basis  + (Schrodinger)  but  also  in  the  time  independent  Heisenberg 
basis  wherein  P and  Q are  time  dependent  operators. 

G.  Wave -Particle  Duality 

The  average  properties  of  a localized  single-mode  classical  wavefield  can  be 
described  in  terms  of  the  dynamics  of  a "macro  quasi -particle " in  a manner  converse 
to  that  in  Section  B.  By  a macro  quasi-particle  is  meant  a localized  space-time  wave 
structure  whose  dynamics  is  characterized  by  position,  "momentum,"  and  "energy" 
variables  similar  to,  but  distinct  from,  those  descriptive  of  a real  particle.  From  a 
mathematical  viewpoint  if  the  wavefield  is  given,  the  macro  quasi-particle  description 
is  unique,  in  contrast  to  the  non  uniqueness  of  probabalistic  wave  descriptions  of  a 
given  real  particle  dynamics. 

In  appropriate  ranges  an  overall  wavefield  may  be  decomposed  into  local  mode 
equations  of  say  the  form 

1 It^  = H (l  "Sx  ' x'  I ^ I 20  ) ^ = ( 1 5) 

where  = 4/(x,t)  is  a complex  wave  function  and  H is  in  general  an  x,t  dependent  non- 
linear hermitean  operator  (in  x-space)  indicative  of  the  "energy"  or  frequency  of  the 
mode  in  question.  Such  decomposition  into  uncoupled  equations  of  the  type  Eq.  (15)  is 
not  always  possible,  in  which  case  a multi-mode  wavepacket  description  is  required. 
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For  nonlinear  wavefields  the  decomposition  procedure  is,  in  general,  nontrivial  but  its 
consideration  will  not  be  pursued  further  at  this  point. 

For  H an  hermitean  operator  in  x-space,  local  mode  equations  of  the  type  Eq.  (15) 
imply  that  the  wave  function  i|i  possesses  the  property 

+)  = 0 (16) 

where,  as  in  Section  B,  the  hermitean  inner  product  (4>,  41)  is  normalized  to  unity,  with 
-*  0 as  | x | cc.  A time  dependent  average  property  u(t)  of  the  wave  function  is  ob- 
tained from  a classical  hermitean  operator  U via: 

u(t)  = (i|<,  U+)  . (17) 

If  the  operator  U is  both  x and  t dependent,  the  time  derivation  of  u is  given  by 

aru(t)  = (I?  * u+; + (+•  ul? ) + (+•  frO 


whence  in  view  of  Equations  (1)  and  (3) 


aruW  = (+• 


HU  -UH 

~i7T- 


♦) 


= i[HTu]  + 


W 


(18) 


For  prescribed  H,  Eq.  (18)  leads  to  "equations  of  motion"  for  the  average  properties 
of  a localized  wavepacket  whose  exact  space-time  evolution  is  described  by  Equation 

(15). 


For  example,  let  i|j(x,  t)  be  described  by  the  linear  wave  equation 

■>2 


9x 


+ V(x))4«  = H4* 


(19) 


1 c> 


with  the  initial  condition  i|<  = 4*  (x > 0)  • Let  us  define  k = Y Sx  aS  3 wave  rnornenturn 
operator  and 


x(t)  = (t|»,  xvjj)  and  k (t)  = (+,  i -J^-ijj) 


(20) 


as  the  average  position  and  average  "momentum"  of  a macro  quasi-particle  descriptive 
of  the  wavepacket  i|j(x,t).  The  identification  of  k(t)  as  the  average  momentum  of  the 
wavepacket  follows  from  writing  i}'  = a(x,  t)  exp [i<j)(x,  t)  ],  whence  k(t)  = (a,  a)  since 
a -*  0 as  |x|  -»  o°.  The  trajectory  equations  and  other  dynamical  properties  of  the 
macro-"particle"  can  be  inferred  from  the  following  commutator  relations  between 
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the  k operator  and  an  arbitrary  function  w = w(x)  of  the  operator  x,  viz: 
i[k,  w]  = w'(x) 


i[k  , w]=  kw*(x)  + w'(x)  k 


i[k4,  kw  + wk]  = k£w/(x)  + w*(x)k2  + 2kw7(x)k 


i[kx  + xk,  w]=  2xw'(x)  . (2 Id) 

w'(x)  is  the  x derivative  of  w(x).  Thus  from  Eqs.  (21b)  and  (21a)  with  w = x and  w=  V(x), 

k2 

respectively,  one  obtains  from  Eq.  (18),  with  H = -y  + V(x),  the  trajectory  equations: 


subject  to  initial  conditions  x(0),  k(6)  derivable  from  iMx,  0)  and  Equation  (20),  Similarly, 
for  the  higher  moment  averages: 


x(t)  = (+.  x2+),  k(t)2  = (4<,  k2v)i)  , kw  + wk  = (4>,  (kw  + wk)iW  , 

one  infers  from  Eqs,  (18)  and  (21)  the  defining  equations 


rz  tf 

-gp  = kx  + xk  -y  = kV*(x)  + V^xjk 


y (kx  + xk)  = 2k^  - 2 xV7(x) 

For  potential  functions  V(x)  of  higher  than  the  second  degree,  Eqs.  (22)  and  (23)  are 
not  closed  in  that  the  first  order  moments  x(t),  k(t)  are  dependent  on  higher  moments 
*<t?,  k(t)2,  etc. 

1 2 2 

As  a simple  example,  let  V(x)  = j a x , whence  Eqs.  (22)  become 

dx  r-  dk  2—  , _ 

ar  ■ * ar = _a  x <24a) 

whose  solution  is 

x (t)  = x (0)  cos  at  + sin  at 


k(t)  = k(0)  cos  at  - ax  (0)  sin  at 
Correspondingly,  Eqs.  (23)  take  the  form: 


r 
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dx 

~5T 


= kx  + xk 


dk“  2 dx 

"dT  = _a  ~dt 


2 T 


(25a) 


(kx  + xk)  = 2k  - 2a  x 

~1  2 7 

which  imply  that  k + a x is  a constant  of  the  motion  and  lead,  for  instance,  to 


777?  1 (~~T  k^  , f\ : — r\  sin  2at  , 1 (~~2  , k^ 

<(t)  = 2 lx  - 72  )n  cos  2at  + (kx  + xk;0  -Za—  + 2 lx  + ^7  ;0 


72 


a 0 


(25b) 


T T 2 

kx  + xk  = ( kx  + xk)  q cos2at  - (x  -k  /a  ) ^ a sin  2at 


where  the  zero  subscript  denotes  the  value  at  t = 0.  The  time-dependent  "variances" 

~2  Z — Z ~2  ’Z  — z 

x = x -x  , k = k -k  , etc.  are  useful  measures  of  the  evolving  shape  of  the  macro- 
particle  and  can  be  readily  evaluated  from  Equations  (24)  and  (25) 

A non-closed  example  is  provided  by  the  nonlinear  space-time  dependent  potential 
2 

V(x,  t)  = g|i(j|  descriptive  of  the  nonlinear  Schroedinger  equation,  which  is  characterized 

2 2 

by  a time  dependent  hermitean  operator  H = k / 2 + • Equations  (22)  still  apply 

and  become 


— = k 
dt  K 


§ = -eJ>l2|r]+12d*  = o • 


(26) 


whence 

x(t)  = x(0)  + k(0)t  k(t)  = k(0)  . 


Equation  (26)  implies  that  a wavepacket  having  an  initially  symmetric  amplitude  and 
phase  with  x(0)  = k(0)  = 0 will  continue  to  maintain  this  symmetry  as  time  evolves. 
The  equations  (23)  for  the  higher  moments  lead  to  a nonclosed  description  of  the 

; a 


mac 


and 


roparticle  dynamics  in  that  they  involve  such  moments  as  (let  4<  = ae'^) 
J 1 4*  | ^ dx  = J a^  dx 


whose  evolution  cannot  be  readily  ascertained  unless  one  solves  the  defining  equation 
for  in  detail.  Specifically,  Eqs.  (23)  take  the  form 
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dx 

~sr 


kx  + xk 


dT 


(kx  + xk)  = 2k  - 0 J a dx 


(27) 


which  are  obviously  not  closed.  From  the  defining  Eq.  (19)  for  + one  infers  that 

4 


zrJ*4d*=J' 


5a 


dx 


(28a) 


5a’ 

■sr 


dx 


-*/[(' 


,2  i±)z 


da 


2,2 


. .2  4 - 
Ida  1 


i?J  + ela  "S3r ) - 4~sr  a 


S2a 


5x 


"jdx 


Equation  (28a)  permits  one  to  infer  from  Eq.  (27)  that 


(28b) 


(29) 


Equations  (26)  and  (29)  yield  interesting  but  limited  information  about  the  evolution  of 

the  initial  wavepacket.  Further  information  is  dependent  either  on  the  determination 

c 4 

of  the  time  dependence  of  J a dx,  a non-closed  problem  as  noted  in  Eqs.  (28),  or  on 
equivalent  kinetic  information  discussed  elsewhere. 
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WAVEPACKETS  AS  QUASIPARTICLE  SYSTEMS 
N.  Marcuvitz 

In  nonlinear  and/or  turbulent  wave  propagation  it  is  sometimes  useful  to  view  a 
wavepacket  as  a system  of  point  quasiparticles,  each  described  by  its  position  x(t), 
momentum  k(t),  and  energy  u>(k,  x,t).  The  individual  quasiparticle  motion,  which  is 
determined  by  the  functional  form  of  u>  and  differs  with  each  mode  type,  provides  a 
physical  picture  of  how  a wavepacket  evolves  in  space-time.  A kinetic  description  of 
the  overall  quasiparticle  system  at  time  t is  given  in  terms  of  a distribution  function 
F(k,x,t)  in  k,  x space.  A fluid  description  is  obtained  from  the  k moments  of  F(k,  x,t) 
and  a macro-quasiparticle  description  is  recovered  from  the  k and  x moments  of 
F(k,  x,  t). 

Consider  a single  mode  type  of  wave  process  described  locally  by  a complex  wave 
function  cp  (x,  t)  satisfying 

if*  = H4>  (1) 

with 

h*h4  £•*•'> 

subject  to  an  initial  condition  4>(x,  0).  To  view  the  time  evolution  of  41  in  terms  of  a 
system  of  quasiparticles,  it  is  necessary  to  identify  the  quasiparticle  descriptors  as 
well  as  the  kinetic  distribution  function  F from  the  above  defining  equation  for  41.  For 
this  purpose  one  introduces  the  double  Fourier  representation. 

*1*2  = exp^(klXr  k2X2)]dkidk2/(2TT)2  (2) 

where  <}>.  = <t>(k.,t)  and  4>?  = <j>(k,,t)  are,  respectively,  the  Fourier  amplitudes  of  i|>.  = 
l l c.  c ^ 1 

4i(Xj,t)  and  4*2  = 4'(x2>t)-  The  product  j 4*2  displays  both  a fast  and  slow  spatial  be- 
havior and  it  is  desirable  to  average  over  the  very  fast  behavior.  If  the  4*(x,  t)  and 
hence  the  4>(k,  t)  are  stochastic,  the  averaging  is  explicit  with  Eq.  (2)  being  expressed 
in  terms  of  ensemble  averages  as 

<*t*2>=  JJ  <<J>14>2>  exptUkjXj- k2x2)]  dk1dk2/(2n)2  (2a) 

and  the  average  symbol  < > should  be  retained  throughout  the  following.  However,  if 

the  4>  and  <j>  are  deterministic,  this  symbol  should  be  understood  as  an  average  over  a 
suitably  fast  spatial  period  or  phase  and  will  be  omitted.  After  the  averaging,  "slow" 

(x  or  K)  and  "fast"  (£  or  k)  spatial  or  wavenumber  variables  are  defined  by: 
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x=(x1+x,)/2  K=kj-k2  (3) 

and 

4 = Xj-  x2  k = (kt+  k2)/2 

whence  since 

k i x j " k2x2  = + K*  and  dkidk2  = , 

Eq.  (2)  can  be  rewritten  as 

4^+2  = J F(k,x,t)  exp(ik4 ) dk/2n  (3a) 

where 

F(k,  x,  t)  = J <j>j  4>2  exp(iKx)  dK/2TT  (3b) 

The  reality  property  F(k,x,t)  = F (k,  x,t)  of  the  fast  (correlation)  spectrum  of 
follows  readily  from  Eq.  (3);  however,  positive  definiteness  is  not  in  general  assured. 
Fourier  transformations  inverse  to  Eq.  (3)  are  manifestly 

F(k,  x,  t)  = J 4'1+2  exp(ik£  ) d|  (4a) 

4>j  (J>2  = J F(k,  x,  t)  exp(-iKx)  dx  (4b) 

For  4=0  and  K=  0,  Eqs.  (3a)  and  (4b)  yield 
1 4<(x,  t)  | 2 = J F(k,x,t)  dk/2TT 

|<j,(k,t)|2  = j F(k,  x,  t)  dx 
whence  one  infers  that 

J\  +(x,  t)  | 2 dx  = J | <)>(k,  t)  | 2 dk/2rt  = J*F(k,  x,  t)  dk  dx/2TT  (5) 

Even  though  only  reality  but  not  positive  definiteness  of  F(k,  x,t)  is  assured,  Eq. 

(5)  suggests  the  identification  of  F(k,x,t)  as  a number  density  of  quasiparticles  in 

k,x  phase  space.  The  consequent  interpretation  of  |k|j|  as  a quasiparticle  number 

£ 

density  in  real  space  and  of  1 4>  | as  a number  density  in  momentum  space  are  in 
conformity  with  a similar  concept  in  quantum  mechanics  although  in  the  latter  instance 

z z 

because  of  a different  normalization  |i(jj  and  |<j>|  are  given  a probabilistic  interpre- 
tation. 
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The  defining  equation  for  F(k,  x,  t)  can  be  deduced  from  Eq.  (1)  if  one  infers 


therefrom 


‘y<h+2,>  = <HrH2>w'ih!,> 


where 


Hi  = H<f  H2  ~ h^~  T 


Bx„  ’ 2 


, x..,t) 


Noting  the  definitions  Eq.  (3)  and  forming  the  inverse  transform  Eq.  (4a)  of  Eq.  (6), 


one  obtains 

i_5r  = /exp(-iki)  [h(4--|t  + JT  |x  ’ x + I ’ " H(f  ^ ' Ti  ^ ’ X ' 1 ' J + d ? * 

After  commuting  the  operator  H so  that  it  acts  only  on  exp(-ik|  ) with  a consequent 
change  in  sign  of  one  finds  as  the  defining  "kinetic"  equation  for  F 

* If = [H(k  + ir  • x ' w Ik  > l)  ■ H(k  " In  4?  ■ x + 2T  Ik’  tJF  (7) 

Assuming  a weak  dependence  of  F on  k and  x,  one  expands  H in  first  order  to  obtain  as 
the  "quasiparticle  approximation"  to  (7): 

1I+^SF-|^|^  = 0 (8) 

St  + Bk  3x  Sx  Bk 

where  to  = H(k,  x,  t).  The  characteristic  equations  of  the  partial  differential  Eq.  (8), 
namely : 

dx  _ Bu  dk  _ Bco  im 

dt  Sic  ’ dt  Bx 

identify  the  trajectories  of  quasiparticles  of  position  x(t),  momentum  k(t)  and  energy 
oj(k,  x,t).  On  a quasiparti cle  trajectory,  one  observes  in  the  quasiparticle  approxima- 
tion that  for  the  mode  type  Eq.  (1): 

af  = 0 “«*  It  = lr  (10) 

The  implied  constancy  of  F on  a quasiparticle  trajectory  starting  at  x(0)  = xQ  and  k(0) 

= kp  permits  the  solution  of  Eq.  (8),  evolving  from  a given  initial  state  F(k,x,0),  to 
be  written  conventionally  as 

F(k,  x,  t)  = F(kn(k,  x,  t),  xn(k,x,t),0)  (11) 


(ID 
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where  k0(k,  x,t),  x0(k,x,t)  define  the  quasiparticle  trajectory  starting  at  kQ,  xQ,  at 
time  0 and  ending  at  k,  x at  time  t. 

"Fluid  dynamic"  properties  of  the  quasiparticle  system  can  be  inferred  from  the 
k-moments  of  the  kinetic  distribution  function  F.  Such  properties  are  representative 
of  macroscopic  characteristics  of  wavepacket  solutions  of  Fq.  (1)  in  regions  wherein 
wavepackets  form. 

If  a typical  wavepacket  solution  is  represented  as 

+ (x,t)  = a(x,  t)  exp(i6(x,  t))  (12) 

then  as  noted  in  Eq.  (5)  the  local  amplitude  a follows  from  F(k,  x,  t)  via: 

a2 *(x,t)  = j F(k,  x,  t)  dk/2rr  (13) 

2 

One  observes  that  the  amplitude  squared  a (x,  t)  of  the  wavepacket,  or  the  zeroth  moment 
of  F,  is  indicative  of  the  number  density  of  quasiparticles.  The  local  phase  8(x,  t)  of 
the  wavepacket  can  be  determined  from  the  local  wavenumber  and  frequency, 

k(x,  t)  = ||  and  v(x,t)=-|£  , (14a) 

•A 

by  noting  that 

0(x,t)  = 9(x0,0)+  J (ledx  - vdt)  (14b) 

where  the  integration  in  x,  t space  is  over  an  appropriate  path  extending  from  xQ1  0 to 
x,t.  To  find  the  local  wavenumber  and  frequency  one  first  observes,  in  view  of  (14a), 
that 

k = ^ "lx  ^ ^ "Sc  ^/Zia2 


v=  i(+  -§r  + - + -§r  + )/2a 

whence,  using  Eqs.  (3a)  and  (1),  one  derives  the  relations 

1^2  rsl^l  _+l  iiq  *2\  =Q  = I /kF(k.x.t)  dk/2TT  = k(x,t)a‘ 


2 (4j2H1  4'r^lH2+2)  " /["(k,x,t)  * S’  + 

9 = U dk  ox 


• "]  F dk/2TT  = v(x,  t)  a2 
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whuh  identify  the  local  wavenumber  k and  frequency  v as  averages  or  moments  of  the 
quasiparticle  distribution  function  F.  Thus,  via  Eqs.  (3),  (8),  (13),  and  (14),  the  space 
time  dependent  local  characteristics  of  a propagating  wavepacket  can  be  ascertained 
from  a kinetic  description  starting  from  initial  values  F(k,x,0)  and  0(x,  0),  which  are 
derivable  from  4j(x,  0). 

An  alternative  "fluid  dynamic"  picture  of  a propagating  wavepacket  can  be  in- 
terred from  moments  of  the  quasiparticle  kinetic  Equation  (8).  This  "fluid"  description 
is  closely  related  to  the  conservation  equations  obtained  by  Whitham  from  his  averaged 
variational  formalism  for  propagating  wavepackets.  One  defines  a local  velocity  v"  and 
force  f for  the  quasiparticle  system  representing  the  wavepacket  via  the  F moments: 

a2v(x,t)  = J (k,  x,  t)  F(k,  x,  t)  dk/2TT  (16) 

2 

a2T(x,  t)  = J —■  (k,  x,  t)  F(k,  x,  t)  dk/2TT 
3k  ^ 0X 

Then  from  the  quasiparticle  kinetic  Eq.  (8),  written  in  the  form: 

3F  3 ,3cj  3 ,3w  „ 

^ + 3^  ( 3k  F)  ' 3k  ( 3^  F)  = ° ’ 

one  obtains,  by  forming  in  a conventional  manner  zeroth  and  velocity  moments,  the 
"fluid"  equations: 


3a  3 . 2-.  . 

-5T  + ^ (a  v)  = 0 


3 , 2^.  . 3 . 27.  2 — 

(a  v)  + -5^  (a  v ) - a f = 0 


(17a) 


‘5FVii  v>  T‘§xv<1  v 1 ~ * I=u  (17b) 

Using  Eq.  (17b),  one  can  rewrite  F.q.  (17b)  in  the  more  conventional  "fluid"  form 


ir  + v + ~ ■§?  = f 

a 

where,  as  in  (16),  one  defines 
a2  v^  = j(|^)2  F dk/2Tt 


(17b) 


(18a) 


and  introduces  a local  quasiparticle  "pressure"  p(x,t)  via 

27  2-2  - 

a v = a v + p 


(18b) 
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Equations  (17)  constitute  two  equations  for  four  unknowns  and  hence  additional  informa- 
tion on  p and  f is  required  for  closure.  Determination  of  the  requisite  information  is 
perhaps  best  clarified  by  a particular  example. 


For  the  case  where  the  quasiparticle  "energy"  is  specified  by  the  dispersion  re- 


lation 


u(k,  x,  t)  = -j-  + V (x,  t)  , 
corresponding  to  an  Eq.  (1)  with 


H 


L2 

dx 


{ + v (x,  t) 


(19a) 


(19b) 


Eqs.  (16)  imply  that 


and  T = - 


dV 


(20a) 


To  determine  the  local  "pressure"  p one  first  observes  that 


dx 


1=0 


whence  evaluating  the  right  hand  side,  inserting  Eq.  (12)  into  the  left  side  and  equating, 
one  finds 


-^2  _ d2a  _1_  32  a2 

3 5x2  4a2  &x2 


- - 2 ~I 

hence,  since  v = k and  v = k , from  (18b) 


- 1 52  2 32 

a - a — *-  a 


(20b) 


and  therefore 


3x 


(20c) 


i a 


a 

■ST 


(20d) 


On  use  of  the  identities  Eqs.  (20),  (17a)  and  (17b),  become  a closed  set  of  "fluid"  equa- 

2 

tions  for  the  local  quasiparticle  density  a and  velocity  k,  viz: 


*T"  + ^T(a  ^ = 0 


(21a) 
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3k  + 17  a (±.  &2a  > - 2V 

3t  3x  3x  \2a  ^ 2 / ’ 9x 


(21b) 


In  the  context  of  the  Whitham  formalism  for  the  case  Eq.  (19),  it  is  of  interest  to  note 
from  Eqs . (19a),  (15b),  and  (20b)  that  the  local  frequency  v takes  the  form 


(22) 


and  identifies  the  averaged  dispersion  relation  in  the  Whitham  formalism.  The  relation 


3k  3v 
■ST  3x 


= 0 


implied  by  Eq.  (14a)  and  termed  the  "wave  conservation  equation"  by  Whitham  then  be- 
comes with  the  aid  of  Eq.  (22)  the  "fluid  momentum"  Equation  (21b). 

Numerical  comparisons  of  the  quasiparticle  approximations  with  exact  wavepacket 
solutions  will  appear  in  a subsequent  report. 
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RENORMALIZATION  OF  MAXWELL'S  EQUATIONS  FOR  TURBULENT  PLASMA 
S.  Barone  and  N.  Marcuvitz 

The  statistical  theory  of  a classically  describable  plasma  poses  a coupled  field 
problem  governed  by  Maxwell's  equations 


d^B  + VxE  = 0 


VxH  = , 


and  the  Klimontovich  equations 


V-  B = 0 


V • D = p + p 


ext  ’ 


[dt+  v .V+  ^(E  + vxB)  • Vv]fa=^( 


one  for  each  species  of  charged  particle  (a  =1,  2,  ...)•  For  any  particular  member 
of  the  statistical  ensemble  the  current  (J)  and  charge  ( p)  densities  that  couple  Maxwell's 
equations  to  the  Klimontovich  equations  are  given  by  velocity  integrals  of  the  particle 
distribution  functions  for  that  member  of  the  ensemble 

J(r,  l)  = LJ  J qa-  ’ 

a 

(3) 

p(r,  ‘I  : j J (d^)  qa  ^ 

a 

The  microscopic  content  of  the  Klimontovich  equations  is  that  the  stochastic  fields  E, 

B determine  the  particle  orbits  according  to  the  usual  Lorentz  force  law.  It  is  con- 
venient to  include  external  current  and  charge  densities  Jgxt  , Pext  , the  space-time 
dependence  of  which  is  prescribed,  i.e.,  unaffected  by  the  behavior  of  either  the  field 
vectors  or  the  particle  distribution  functions.  The  external  sources  are,  for  example, 
used  to  generate  externally  imposed  fields  which  are  assumed  to  be  unaffected  by  the 
motion  of  the  field-plasma  system  and,  also,  to  radiate  the  initial  field  distribution 
(at  say,  t=  0)  from  a quiescent  situation  for  t < 0.  The  external  sources  T|  serve  the 
same  purposes  for  the  particle  systems. 

The  Maxwell-Klimontovich  equations  characterize  an  attempt  to  relate  the  macro- 
scopic behavior  of  a field-plasma  system  to  the  behavior  of  single  particles.  If  (as  an 
approximation)  it  is  assumed  that  the  particles  in  any  particular  member  of  the  ensemble 
move  as  if  only  the  ensemble  averaged  field  acted  on  them,  the  solution  is  secular. 

1 ' 1 ^ 

' For  any  quantity,  X defined  on  the  ensemble  we  use  X £ <X>  to  denote  its  average 
over  the  ensemble  and  to  denote  the  difference  between  X for  any  particular 
member  of  the  ensemble  and  the  average,  i.e.,  X s ft-X. 
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Thus,  the  effect  of  field  fluctuations  g,  g on  the  particle  orbits  is  crucial  in  the  develop- 
ment of  the  general  theory.  The  analogy  between  the  present  situation  and  quantum 
electrodynamics  is  apparent.  It  is  now  widely  believed  that  the  macroscopic  physical 
content  of  the  Maxwell-Klimontovich  equations  will  be  more  clearly  revealed  if  these 
equations  are  renormalized  in  a manner  similar  to  that  which  has  been  successful  in 
electrodynamics.  The  expectation  is  that  the  major  effect  of  the  field  fluctuations  on 
the  particle  orbits  can  be  identified  and  handled  in  a general  way  leaving  only  genuinely 
small  residual  effects  to  be  treated  by  perturbation  theoretic  procedures. 

In  this  work  it  is  shown  how  Maxwell's  equations  can  be  completely  renormalized 
without  approximation  or  expansion.  It  is  anticipated  that  this  formulation  of  the  theory 
will  be  useful  when  parametric  and  other  nonlinear  effects  occur  within  the  spectrum 
of  the  fluctuating  field.  Secondly,  we  emphasize  the  extent  to  which  renormalization 
procedures  are  arbitrary  and  the  circumstances  under  which  our  procedure  makes 
optimum  use  of  the  available  information.  Thirdly,  we  derive  a renormalized  equation 
for  the  two  point  field  correlation  function.  This  equation  is  the  analog  for  the  electro- 
magnetic field  of  the  Bethe-Salpeter  equation.  It  leads  to  kinetic  equations  for  inter- 
acting quasi-particles.  Finally,  we  discuss  a Klimontovich  equation  for  interacting 
quasi -particle s . This  equation  is  appropriate  for  the  discussion  of  the  collective  modes 
of  oscillation  of  quasi  particles  and  their  description  in  terms  of  new  types  of  quasi 
particles . 

Following  Weinstock^  we  write  the  fluctuating  part  of  the  distribution  function  in 
the  form 

f = Ga(B  + V f)  (4) 

where  is  an  appropriately  defined  Green's  function  and  V is  the  fluctuating  part  of 
the  operator  in  Equation  (2).  The  kinetic  equation  for  f becomes 

[lo- <vgav>]  f = Ti  + <vga^>  , (5) 

where  Lq  is  the  ensemble  average  of  the  operator  in  Equation  (2).  This  equation  re- 
lates the  ensemble  averaged  distribution  function,  f,  to  the  source  T).  The  relation- 
ship between  f and  r|  depends  on  the  fluctuating  fields,  V. 

The  renormalization  of  Maxwell's  equations  is  straightforward  once  the  fluctuating 
part  of  the  current  density,  J,  is  decomposed  according  to  Equation  (4).  For  simplicity 
of  presentation  we  neglect  the  magnetic  field  fluctuations  in  V and  specialize  to  a single 
component  plasma  (q  ->  -e).  Maxwell's  equations  may  be  written 


i 
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YM-  5 = -lext.  + J(di)evf(v)  , 
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(6) 


where  is  the  field  operator  discussed  in  Ref.  (2)  and  the  velocity  coordinate  has 
been  made  explicit.  For  the  ensemble  averaged  field  we  have 

YM*5  = -Jext.  +/<dv>evf(v)  . (7) 

which  is  to  be  solved  self  consistently  with  Eq.  (5)  for  the  average  distribution  function, 
f.  These  two  equations  do  not  form  a closed  system  because  the  equation  for  f involves 
the  field  fluctuations,  E. 

Using  Eq.  (4)  the  fluctuating  part  of  the  field  is  seen  to  satisfy 

YM-|  = -Lxt.  • <»> 

where  we  have  shortened  the  notation  so  that  velocity  sums  are  indicated  by  a simple 
integral  sign.  Since  V = (e/m)  E . Vv  the  above  equation  may  be  written 


r 2 - 

Y.  . - — fvG.V  f | . E = - J + [evG.fi 
LMm-'-AvJ  - -ext.  J - A 1 


(Y.  . + P + P.)  . E = -(J  + j'  . ) 

' M A'  '-ext.  -ext. 


where 


J ' = f (-e) v G 

-ext.  J ' - i 


and 


■P.  si.  fvG.V  f + P, 
A m J - A v 


(9) 


(10) 


(ID 


(12) 


where  P is  an  arbitrary  operator  that  does  not  vary  over  the  ensemble.  According  to 

the  present  point  of  view,  renormalization  is  accomplished  by  selecting  P so  that  the 

P^  term  in  the  above  equation  has  as  small  an  effect  as  possible  on  the  evolution  of 

E.  If,  in  fact,  P.  has  a small  effect 
- A 

(-Ym-p).EMJext.  + i;xt.)  • (13) 

which  shows  that  P should  be  selected  in  such  a way  that  the  Green's  function  (-Y^  -P) 
relates  the  fluctuating  part  of  the  field,  E to  the  fluctuating  part  of  the  externally 


-1 
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externally  imposed  current  densities  J 4 and  j'  In  other  words  (-Y..-P)'1  should 
relate  the  deviation  from  the  average  field  in  any  particular  member  of  the  ensemble 
to  the  deviation  from  the  average  of  the  externally  imposed  current  densities.  There 
is  no  one  choice  for  P that  will  accomplish  this  exactly  for  all  members  of  the  ensemble. 
However,  the  relationship  between  E and  (J  + J ' ) will  not  vary  by  a large  amount 

•CXI,  •cxi, 

over  most  of  the  ensemble.  The  overwhelming  majority  of  the  members  of  the  en- 
semble evolve  in  nearly  the  same  way  as  the  ensemble  average.  In  the  absence  of 

further  information  the  optimum  choice  for  P is  such  that  (-Y.  ,-P)"1  is  the  same  as 

M 

the  Green's  function  that  relates  a small  change  in  the  external  current  densitv  6J 

’ -ext. 

to  the  corresponding  small  change  in  the  ensemble  average  6E.  In  particular,  suppose 
that  the  small  quantities  6E  and  are  linearly  related  by: 


6 E = t)'  . 6 J 
- -ext 


(14) 


where 


s an  appropriate  Green's  function. 


We  propose  that  P be  selected  so  that 


(15) 


and  in  analogy  to  the  corresponding  situation  in  electrodynamics  refer  to  P as  the 
polarization  operator.  ^ In  functional  derivative  notation 


6 E 


(16) 


P = 


6 J 

67  ~ 

-ext. 


.'-I 


6 J 
Fe" 


(H) 


A renormalized  differential  equation  characterization  of  the  two-point  field  cor- 
relation function  follows  from 

(YM  + P)  * I = -(V  71>  ’ 08) 

where  3 ^ = P^  E . In  an  obvious  notation 

(YM+  P)1(YM+  P)2 ' <?(1)  ${Z)>  = < [Zt(D  + 3j(l)]  [jt(2)  + 3j<2)]>  , (19) 

so  that  defining  an  interaction  operator,  I by 

< 3j(l)  J,(2)>  =I(1,1;2,2'):  <£(l')  E(2')> 


(20) 
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we  have 

[<YM+  PMYM  + P^2  ” ^12!  ' <f^1)  ?(2)  > 

= < Jt(l)3t(2)>  + <3t(l)  Jj(2)  + Jjd)  Jt(2)  > . (21) 

If  2 j is  related  back  to  the  external  currents  Jt  via  an  operator  B12,  the  above  equation 
for  the  two-point  field  correlation  function  reads 

L(YM+P)1(YM+P)2_I12]:  <|(D  |(2)  >=  Bu:  <Jt(l)  Jt(2)>  (22) 

If  a two-particle  Green's  function  is  defined  by 

LfYM+  P*1  <YM+  P*2  " *12  ] : C12  = B12  ’ <23) 

we  have 

<E(1)  E(2)>  = G(l,  l';  2,2')  : < Jt(l')  Jt(2')  > . (24) 

When  the  effect  of  Ij2  is  small  it  is  useful  to  define  renormalized,  collective  modes 
or  wave  types  by 

(Yj^+  P)  • E^  = 0 , (1=1,2....  (25) 


The  derivation  of  kinetic  and  Klimontovich  equations  for  interacting  quasi -particle  now 
follows  in  a straightforward  way  from  previous  results.3 
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A NEW  APPROACH  TO  SOME  NONLINEAR  IONOSPHERIC  PLASMA  TURBULENCE 
PROBLEMS 

S.  R . Barone 

Plasma  phenomena  in  which  electron  and  ion  inertia  play  a minor  role  are  often 
modeled  by  special  cases  of  the  two  fluid  equations  ^ ^ 


9"  + 

3 t 

V • 

(n  v 
~ e 

) = 0 , 

(1) 

In  + 
at 

V • 

(a  v. 

) = 0 , 

(2) 

ne(E 

+ V 

- e 

x B)  - 

tVP  + nm  v v +nm  v ,(v  - v.)  - nm  g = 0 , 
e e en-e  e ei  ~e  - i e ~ ’ 

(3) 

-ne(E 

+ V. 
- 1 

x B)  - 

t VP  + nm.  v.  v.  + nm.  v.  ( v.  - v ) - nm.  g = 0 
i i in  -l  i ie  - 1 - e i £ ’ 

(4) 

Vx  E 

= 0 

t 

(5) 

in  which  both  electron  and  ion  inertia  have  been  neglected;  the  electron  and  ion  densi- 
ties have  been  set  equal;  the  transverse  part  of  the  electromagnetic  field  is  omitted; 

B is  a uniform,  static  magnetic  field;  v , v.  are  the  electron  and  ion  collision  fre- 
~ en  in 

quencies  with  neutrals,  v . and  v.  are  the  electron  and  ion  collision  frequencies  with 

ei  le 

each  other  (all  assumed  to  be  independent  of  position)  and  g is  the  acceleration  due  to 
gravity.  These  equations  are  written  in  the  frame  of  the  neutrals  and  are  ordinarily 
closed  by  assuming  either  adiabatic  or  isothermal  behavior  for  the  electrons  and  ions. 

The  purpose  of  this  report  is  to  point  out  that  for  two-dimensional  flows  and/or 
turbulence  perpendicular  to  B there  is  a useful  way  to  introduce  scalar  and  vector  po- 
tentials. For  simplicity  of  presentation  we  suppose  that  P = nk„T  , P.  = nk„T,  where 

e B e i B i 

kg  is  Boltzmann's  constant  and  the  electron  and  ion  temperatures,  T , T.  are  constant. 
Set 


m.(v.  + v.  )-m  v . m + m. 

l tn  ie'  e ei  <->  . e i 

n v - V v — V x a + n p 

-e  *■  m_  v__  + m.  v._  - m v +m.v. 


(6) 


e en  i in 


e en  i in 


m (v  + v .)  - m.  v. 


m + m. 

_ r-r . e en  ei  l ie  e l 

~ ni  . (7> 


m v + m.  v. 
e en  i in 


m v + m.  v. 
e en  i in 


where  ■y  is  a scalar  potential  and  the  vector  potential  a has  only  a single  component 
parallel  to  B.  Substituting  into  Eqs.  (1)  to  (4)  yields: 

1)  The  component  of  a in  terms  of  n and  y, 

eB  • a = k0(T  + T. )n  + (m  v + m. v.  )y  , 

~ Be  l ' e en  i mu 


(8) 
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2)  The  electric  field  in  terms  of  n and  X, 


- en  E = 


fm  ( v<  + v . ) - m.  v. 


e en  ei 


1 le 


m v + m. v. 
e en  i in 


k_T 
B e 


m.  (v.  + v.  ) - m v 


i in  le 


m v + m.  v. 
e en  i in 


kBTi 


Vn 


+ [m  (v  + v .)-m.(v.  + v.  )]VX 

1 e en  ei'  i in  le  ' 


(v  v.  + v v.  + v.  v .)/n  . 

+ Cn  m en  — , — -kJT  +T.)VnxeB 

m v + m.  v.  B'  e i' 

e en  i in 


+ 1 + 


v v. 
en  in 


+ v v.  + v.  v A 
_ en  le  in  eii 

Q.  ) 

e i / 


m m.(v  - v.  ) 

\ . e i en  in 

VX  x e B + n g 

m v + m.v.  - 
e en  i in 


m + m. 

e i „ 

+ — — — __  — ngxeB  , 


m v + m. v. 
e en  i in 


(9) 


where  fle  ^ E eB/m^  ^ are  the  electron  and  ion  cyclotron  frequencies,  and 


3)  A connection  between  n and  X viz.  , 


3 n . r-,  r -,2 

aT  +-Vg  • Vn  = -V  X . 


(10) 


where 


m + m. 
e i 


~ g m v + m.  v. 

e en  i in 


g 


(11) 


and  we  have  assumed  that  the  sources  of  the  gravitational  field  are  outside  the  region 
of  interest  so  that  V • g = 0. 


The  final  equation  (5)  can  be  expressed  in  a variety  of  forms.  Taking  the  curl  of 
Eq.  (9),  we  have 


(v  v.  + v v.  + v.  v )/U  fl.  , 

en  in  en  le  in  ei"  e l , . ^,,-,2 

^ +nTiT kB(Te  + Tl)V  n 

e en  1 m 


(■ 


V V.  + V V.  + V.  V A , 

, I | en  in  en  le  in  eilr-,2 

1 si  n.  r K 

e l / 


= V 


x B 


- — - v + (v  - V.  ) 
g2  -g  en  in  m_  + m 


mm.  v x eB) 
e i 2.' 


e i (eB) 


(12) 


Using  Eq.  (9)  to  eliminate  E yields  a second  relation  between  n and  X,  viz., 


v2x*«[v2-(2f)]"  = (v)-Vx  . 


1 


(13) 
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where 


1 kB(Te  + V 

O 12.  m v + m. v. 

j + _ e i e en  l in 

v v.  + v v.  4 v.  v . 
en  in  en  le  in  ei 


V E V +\  ^xV  , 


V + V V.  + V. 

en  ei  in  le 


v v.  + v v.  + v.  v 
en  in  en  le  in  ei 


and  we  have  assumed  that  g has  no  component  parallel  to  B.  According  to  the  present 
point  of  view,  one  solves  the  evolution  equation  (10)  simultaneously  with  the  constraint 
(13)  for  n and  X and  calculates  the  velocity  vectors  and  electric  field  via  Equations  (6) 
to  (9).  Examples  of  the  utility  of  the  technique  will  be  discussed  elsewhere. 
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PRELIMINARY  REPORT  OF  NUMERICAL  SIMULATION  OF  TYPE  II  IRREGULARITIES 
IN  THE  EQUATORIAL  ELECTROJET 

S.  Barone,  N.  Marcuvitz,  R.  Pascone  and  N.  Solimene 

Type  II  irregularities  in  the  ionization  density  in  the  equatorial  E- region  are  be- 
lieved to  result  from  the  mutual  interaction  of  linearly  unstable,  long  wavelength,  hori- 
zontally propagating  (east-west)  waves  and  shorter  wavelength  vertically  propagating 

1 2 

waves  which  become  unstable  in  the  presence  of  the  horizontally  propagating  waves.  ’ 

Numerical  studies  of  the  evolution  towards  the  steady- state  have  been  carried  out  by 

3 4 5 6 

McDonald  ’ et  al.  , Sato  and  Ogawa  and  Ferch  and  Sudan.  These  authors  all  work  with 

with  the  two  fluid  equations 


! + V • <"-V  ■ 0 • 

(1) 

f t v • <„ jrj  = 0 . 

(2) 

\7n 

+ e(E  + v xB)+k_T  + m v v = 0 , 

- ~ e ~o  Ben  e e ~ e 

(3) 

r7n 

- e(E+v.xB  ) + k T + m.  v.  v.  = 0 , 

-i  -o'  Bin  i 1-1 

(4) 

V x E = 0 , 

(5) 

in  which  both  electron  and  ion  inertia  have  been  neglected,  the  electron  and  ion  densities 

have  been  set  equal,  the  transverse  part  of  the  electromagnetic  field  is  omitted,  is 

the  magnetic  field  of  the  earth  and  v . are  the  electron  and  ion  collision  frequencies 

e , \ 

with  neutrals. 

For  two-dimensional  turbulence  perpendicular  to  Bq  this  system  of  equations  can, 
without  approximation,  be  reformulated  as  the  single  equation^ 

(1  + e'+V4,  • Vge+)  = kV2+  - e'4,  v • V+  . (6) 

Ot  o 

where  is  related  to  the  electron  density  by 

n = nQe4'  , (7) 

nQ  is  constant,  g is  the  Green's  function  for  the  two-dimensional  Laplacian,  i.e., 

•V2g(r,  r')  = 6(r- r')  , (8) 

the  differential  operator  V is  a linear  combination  of  the  gradient  and  curl  operations 
B 

V E V - V V x , 

o 


(9) 
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and  the  constants  K,  K,  vq  are  determined  by  the  system  parameters.  For 
sufficiently  weak  turbulence  one  might  expect  a quadratic  approximation  to  equation 
(6)  to  be  sufficmnt.  To  second  orde  r in  4* 

fa 


l9t 


K 'T 


+ 0 * +)  Yq 


V 

J 


(£>  + K 1 ) 


(10) 


where  D — v u v and  1 — 
» b ~o  « 


g v • The  nonlinear  term  on  the  left  has  the  effect  of 
changing  the  parameter  vq  somewhat  as  the  level  of  turbulence  increases.  The 
relative  importance  of  the  nonlinear  K and  D terms  on  the  ri^ht  depends  on  the 
numerical  values  of  the  parameters  and  the  wavelength  under  consideration. 

For  the  present  situation  suppose  that  the  background  electron  density 
increases  exponentially  with  altitude  (z -coo rdinate)  i.  e.  , 


l|)  = + vjj 

L 


(in 


where  L is  the  vertical  scale,  and  that  the  electron  drift  velocity  is  predominantly 
towards  the  west  (y-coordinate).  Then  without  approximation 


2. 

+ (1  - cjd  v 


1 z 
L ~ 


4-' 


(12) 


= V’cV'.  (D  + K 1^1  . cvli' 


where  zq  is  a unit  vector  in  the  (vertical)  z-direction.  The  linear  dispersion  relation 
for  waves  of  the  form 
ik  . r - i cat 


4 of  e 

can  be  read  directly  off  Fq.  (12)  viz. 


(13) 


(14) 


where  k = k + \ k x x°  is  a unit  vector  in  the  direction  of  the  earth's  magnetic  field. 


In  the  E-region  electron-ion  collisions  are  negligible  and 


v v. 
e i 


kR(T  +T.) 
iJ  e i 


U U. 

K = ~ 

v v.  m v + m.  v . 
1 + _ ' e 1 «?  e l i 


(15) 


U S2 
e i 
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(’-6) 


1 + 


V V . 

e i 

12  12. 
e l 


Numerical  values  appropriate  to  the  daytime  equatorial  electrojet  are: 


v = 4 x 1 0 /sec.  , 


v.  = 2.  5 x 10  /sec. 


12  =5x10  /sec. 
e 


12^  = 90/sec. 


C2  5k_(T  + T./m.  = 105m2/sec.  2 

s B e l l 


L = 6000  m 


= 100  m/sec. 


where  is  the  electron  drift  velocity.  Thus, 

K = 7.  3 m2/sec.  , 

\ = 23  . 


and  the  electron  drift  velocity  V . and  the  velocity  parameter  v 

~a  — o,  are  approximately 

related  by 


V : 
~ O 


V-d 


1 + 


V V. 

e l 

12  12. 
e i 


= 82  m/sec. 


(17) 


Under  these  circumstances  the  1/L  contribution  to  the  real  part  of  cj  is  negligible. 


* \ k x x and  Eq.  (14)  becomes 


, Y,2  * -•  -o  . \ 

w=k.v  ■ l |Kk  - 7 r k 

~o  l - L ,2 


Vvaves  traveling  towards  the  west  are  unstable  when 
k4  < k2  k2 

c y 


(18) 


(19i 
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where  k = VXvo  7 L ic  = 2 it  /30  m.  Alternatively,  linear  instability  occurs  when 
k < for  waves  which  travel  at  an  angle  of  elevation  (9)  small  enough  that 


cos  9 > Jk  7k 
y c 

Run  4,  (Figs.  4-l(a.)(b),  6(a)(b),  ll(a)(b)),  is  for  an  initial  distribution 


4>(t  = 0) 


. 50  cos 


2it  y 2 it  z 

100  COS  100 


(20) 


where  y,  z are  measured  in  meters.  According  to  the  linearized  theory  this  wave 
will  grow  exponentially  (exp  yt)  with  y = 0.  10/sec.  Actually  this  initial  distribution 
grows  at  a somewhat  smaller  rate  for  about  0.  3 sec.  at  which  time  significant 
distortion  is  apparent  and  the  maximum  value  of  the  distribution  begins  to  decrease. 

The  maximum  value  of  the  distribution  increases  again  at  0.5  sec,  and  decreases  mono- 
tonically  thereafter,  i.  e.  , v until  t = l.  2 sec.  Throughout  this  time  interval  the  wavenum- 
ber spectrum  spreads  considerably  in  z corresponding  to  the  development  of  sharp  vari- 
ations in  the  z-direction.  There  is  no  evidence  of  any  turbulent  structure  developing. 

Run  9,  (Figs.  9-l(a)(b),  4(a)(b),  U(a)(b),  18(a)(b),  24(a)(b))  is  for  an  initial  distribu- 
tion 


4i(t  = 0) 


. 03  s in 


2n  y 
72 


- 0. 


sin 


2it  z 
72 


where  again  y,  z are  measured  in  meters.  The  linear  growth  rate  for  the  wave 
traveling  in  the  y-direction  is  ) 5 . 26/sec.  while  according  to  the  linear  theory  the 
standing  wave  in  the  z-direction  decays  exponentially  with  ")  E . 056/sec.  (E  18  sec. 
decay  time).  For  the  time  duration  of  this  run  (2.  3 sec.  ) the  relatively  large  wave 
t ra  v Mg  in  the  y-direction  simply  grows  exponentially  with  the  linear  growth  rate 
as  if  the  relatively  small  standing  wave  in  the  z-direction  was  not  present.  The 
relatively  small  wave  however  does  not  evolve  linearly.  In  the  presence  of  the  y 
directed  wave  the  vertical  wave  breaks  up  into  three  (or  more)  waves  with  the  same 
propagation  constants  in  the  z-direction  as  the  original  wave  but  with  propagation 
constants  in  the  y-direction  equal  to  + the  y propagation  constant  of  the  large  wave. 
The  amplitudes  of  these  three  (or  more)  components  oscillate  in  time.  Again,  there 
is  no  evidence  of  turbulent  structures  developing. 

Run  11,  (Figs.  1 1 - 1 (a) (b),  1 3 (a)(b),  2 5 (a)  (b),  36  (a)  (b)  fo  r an  initial 
distribution 

0>(t  = 0)  = .04  cos  +.  03  cos  ~)^q~ 

This  distribution  differs  from  the  previous  one  in  that  the  two  waves  present  are  of 


WAVE- MATTER  INTERACTIONS 


kc-  7 30C00  ke-  23  0000  kv  1 886000+03  kl-  6 — — **+3 

mmx.min-  5 000000-02  -S  000000-02  tl me  - 0 0 


3 SOe-01 


2 500-01 


\ seo-0i 


5 000-02 


-5  000-02 


-50  0 -?5  • 00  25  0 50  0 


Power  Spectrum  time  *00  eec . 

ke-  5 00000e-02  kv-  1 886000+03  kl-  6, 000000+03 


23  9 


18  0 


12  0 


5 98 


0 0 


1 01 


kaubj 


Run  4 (Figs.  4-  l(a)(b)) 
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Run  9 (Figs.  9-4(a)(b)) 
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Run  9 (Figs.  9-  24(a)(b)). 
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Run  11  (Figs.  11-1  (a)(b)) . 
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Run  11  (Figs.  1 1 - 25  (a  )(b)) 
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more  nearly  equal  amplitude.  At  first  the  y-directed  wave  grows  at  a rate  approximate- 
ly double  the  linear  growth  rate,  i.e.  , the  presence  of  the  standing  wave  in  the  z-direc- 
tion  enhances  the  growth  rate  for  the  y-directed  wave.  Clearly,  the  reason  for  this  is 
that  the  vertical  electron  density  gradient  associated  with  the  z wave  exceeds  that  as- 
sociated with  the  vertical  scale,  L = 6000  m.  After  about.  5 sec.  the  amplitude  of  the 
y-directed  wave  dominates  all  others  and  this  amplitude  grows  exponentially  with  the 
linear  growth  rate  for  the  remainder  of  this  run  (1.  75  sec.  ).  In  contrast  to  the  previous 
run  the  original  standing  wave  in  the  z-direction  now  leads  to  a spread  in  the  wavenum- 
ber spectrum  in  both  the  y and  z directions.  The  amplitude  spectrum  eventually  be- 
comes rather  complicated.  However,  because  of  definite  phase  relations  between  the 
various  Fourier  components,  the  electron  density  in  configuration  space  remains  fairly 
smooth  and  non- turbulent. 
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NONLINEAR  THEORY  OF  TYPE  II  IRREGULARITIES  IN  THE  EQUATOR!/  L 
ELECTROJET 

S.  Barone 

Farley  and  Balsley1  and  Sudan  et.  al.  ^ have  proposed  that  type  II  irregularities 
in  the  ionization  density  in  the  equatorial  E-region  are  to  be  pictured  as  a quasi- 
steady state  of  two-diminsional  turbulence  perpendicular  to  the  earth's  magnetic  field. 
In  the  presence  of  the  electrojet  current  a non-tur bulent,  smooth  increase  of  ioniza- 
tion sensity  with  altitude  is  linearly  unstable  for  long  wavelength  horizontally  prop- 
agating waves.  This,  gradient  drift,  instability  gives  rise  to  horizontal  density  gra- 
dients and  vertical  electron  velocities,  a configuration  which  is  unstable  for  shorter 
wavelength,  vertically  propagating  waves.  The  mutual  interaction  of  horizontally  and 
vertically  propagating  waves  leads  to  a turbulant  steady  state  in  which  energy  deliv- 
ered to  long  wavelengths  cascades  nonlinearly  to  shorter  wavelengths  and  eventually 

returns  to  heat.  The  steady  state  has  been  studied  analytically  by  Rognlien,  and 

3 4 5 

Weinstock  and  Sudan  and  Keskinen  and  numerically  by  Ferch  and  Sudan,  McDonald 

6 7 8 

et.  al.  ’ and  Sato  and  Ogawa. 


In  this  work  the  two  fluid  equations  of  motion  describing  type  II  irregularities 
in  the  equatorial  electrojet  are  reformulated  in  terms  of  a scalar  and  vector  potential 
for  the  fluid  momentum  densities.  Elimination  of  the  vector  potential  yields  a single 
equation,  first  order  in  time,  for  the  time  evolution  of  the  ionization  density,  ^ viz. 


C.  _ 4* 

1 + e V 'l'  • V g e 
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) at 
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• V4- 


(1) 


Where  g is  the  Green's  function  for  Poisson's  equation  and 
B 

7 = V + a x V (2) 

The  numerical  values  of  K , \ and  the  constant  vector  v are  discussed  in  the 

~o 

preceding  article. 

Sudan  and  Keskinen"1  have  given  a direct  interaction111  15  analysis  of  an  equa- 
tion formally  the  same  as  the  first  iteration  approximation  to  Equation  (1).  Clearly 
the  iterative  approximation  to  Eq.  (1)  fails  for  > 1 or,  for  ionospheric  para- 

meters, 4*  > 4%.  Since  interesting  levels  of  turbulence  may  exceed  4%  we  have 
carried  out  a direct  interaction  analysis  of  the  full  Equation  (1).  Numerical  re- 
sults are  given  for  the  power  law  wavenumber  spectrum  which  results  from  theoret- 
ical considerations,  14  computer  simulations5’7  and  observations15  of  type  II  irreg- 
ularities. 
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Anticipating  that  the  saturated  level  of  + will  be  small  compared  to  unity  we 
neglect  the  three  exponential  factors  in  Equation  (1).  The  Fourier  - .Laplace  trans- 
form satisfies 
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and  u are  the  small  amplitude  mode  frequencies.  The  direct  interaction  approxima- 
te 10-13 

tion  as  developed  by  Kraichnan  leads  to  a pair  of  coupled  equations  for  the  nonlinear 

frequency  shifts,  and  two  point  correlation  function,  1^  viz., 
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The  angular  average  of  I£_  ^ on  resonance  (w  =00^)  can  be  shown  to  satisfy 
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where  Z and  Z'  are  constants  and  A 


are  known  kernels.  The  solution  is 
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When  the  exponents  in  the  above  equation  are  sufficiently  email  that  they  can  be  neg- 
lected  i.  e.  , for  sufficiently  low  levels  of  turbulence  or  for  k — oo 


I rk  I ~ aVok2lk  <10> 

V n 

where  n is  the  spectral  index.  Except  for  the  numerical  coefficient  this  is  the  re- 
sult of  Reference  (4).  (The  numerical  value  of  the  plasma  dispersion  function  used  in 
Ref.  (4)  is  apparently  incorrect.  ) The  above  result  is  valid  for  sufficiently  low  levels 
of  turbulence  or  wavenumbers  large  enough  that  2ira2  |Z  ' |k2  I^/(n-2)  < 1 or  equiva- 
lently |rR  | < kvQ.  For  decreasing  wavenumbers  or  increasing  levels  of  turbulence 
the  exponentials  in  Eq.  (10)  are  important  and  the  nonlinear  modal  widths,  |rk  | in- 
crease to  the  point  where  they  are  comparable  to  the  resonant  frequencies, 

u>  ~ k v . In  particular,  for  sufficiently  high  levels  of  turbulence  or  for  k — 0 
k — ° 


2 /zT 


Z'  > 0 


It  is  convenient  to  define  the  intensity  dependent  wavenumber,  kQ,  by 


2 na2  |Z'  | k2IR  = (n  = 2)  ^ 


For  any  level  of  turbulence,  modes  with  wavenumbers  much  larger  than  kQ  behave 
according  to  Eq.  (10).  Modes  with  wavenumbers  much  less  than  kQ  behave  accord- 
ing to  Eq.  (11).  These  results  are  displayed  in  Figure  (1).  Notice  that  | Fk  | /k  vq  is 
insensitive  to  both  the  value  of  n (3  < n < 4)  and  k over  the  indicated  range 
(0  < k<  1.5  k ).  For  k larger  than  k the  asymptotic  result  of  Eq.  (10)  (Sudan  and 

4 ° ° 

Keskinen)  is  accurate.  For  k < < k the  present  calculation  indicates  that 

o r 

|rk  |/k  vq  saturates  rather  than  continuing  to  increase  as  k decreases  (presumably 

Z'  > 0 ). 

The  relationship  between  the  wavenumber  kQ  that  divides  the  strong  turbulence 
regime  (k  < kQ  ) from  the  relatively  weak  turbulence  regime  (k  > kQ  ) and  the  over-all 
level  of  turbulence  is  given  by 
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(14) 
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Ordinarily  the  second  term  in  the  above  equation  is  negligible  compared  to  the  first. 
Thus,  for  sufficiently  low  levels  of  turbulence  k < k and  the  asymptotic  result 

of  Eq.  (60)  is  valid  over  the  entire  spectrum  of  relevant  wavenumbers.  For  suffi- 
ciently high  levels  of  turbulence  k exceeds  k and  for  at  least  part  of  the  wave- 

o min. 

number  spectrum  saturates.  The  transition  occurs  for  |6n/nQ  |”=  1 / ^(z'|  a, 

or  for  the  previously  noted  numerical  values  |a|  "=  23),  |6n/nQ|  ~ 4%  . If  |Z'|  is  much 
less  than  unity  th«  above  estimate  must  be  increased  correspondingly.  The  relation- 
ship between  kQ/k  . ar,d  the  level  of  turbulence  is  displayed  in  Figure  (2.  ) 


Fig.  2.  k /k  . as  a function  of  the  root-mean- square  level 
6 o min.  ^ 

of  density  fluctuations  for  Z'  = l,  |a  | = 23  and  various 
values  of  the  spectral  index  n. 
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HIGH  POWER  MICROWAVE  PROPAGATION  THROUGH  THE  ATMOSPHERE 
N.  Marcuvitz  and  N.  Solimene 


This  report  summarizes  the  current  status  of  our  study  of  microwave  breakdown 
and  pulse  propagation  phenomena.  The  model  used  to  investigate  the  dependence  on 
power,  density,  pulse  width,  altitude,  frequency,  etc.,  incorporates  experimental  data 
pertinent  to  the  ionization,  recombination  and  energy  transfer  processes.  The  data  is 
represented  by  empirical  formulas.  Some  uncertainty  is  introduced  by  the  spread  of 
values  from  different  sources,  the  transfer  of  data  relevant  to  one  set  of  conditions  to 
other  situations  and  the  extrapolation  beyond  the  range  of  measurements.  The  model 
also  is  restricted  in  that  plane  wave  propagation  in  the  forward  scattering  approxima- 
tion has  been  used.  Thus,  with  respect  to  propagation,  it  is  only  valid  when  the  reflect 
ed  power  is  negligible,  dispersion  is  weak  and  diffraction  spreading  is  neglected.  With 
in  the  limitations  of  the  model: 

(a)  energy  densities  of  1-50  millijoules/cm  may  be  propagated  to  distances 
of  5,000  feet  with  a loss  of  less  than  one-half  the  pulse  energy: 

8 3 

(b)  electron  densities  exceeding  10  /cm  may  be  created  over  the  same  dis- 
tance (much  larger  densities  over  much  shorter  distances  are  of  course 
also  possible). 

The  above  conclusions  may  be  in  error  by  significant  factors  because  of  the  limitations 
of  the  model.  Thus,  it  is  exceedingly  important  to  devise  experiments  to  confirm  the 
computer  modeling. 

The  model  is  given  by  equations  in  a form  suitable  for  a mixed  boundary- initial 
value  problem: 
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where  t is  a local  time,  i.e.,  time  zero  propagates  with  the  velocity  of  light  (unity  in 
normalized  units).  In  this  formulation  P,  the  normalized  power  density  of  the  radia- 
tion, is  given  at  z = 0 for  all  t,  whi'e  N,  the  electron  density,  and  U,  the  normalized 
average  electron  energy,  are  given  at  t = 0 for  all  z ^ 0.  That  is,  Pis  specified  at  a 
front  surface  (the  antenna)  for  all  time  and  N and  II  are  specified  initially  over  the  en- 
tire half-  space  (in  front  of  the  antenna).  As  before,  Eqs.  (1)  and  (2)  may  be  used  with 
all  coefficients  and  rates  given  as  functions  E/p  and  p.  Alternatively,  when  Eq.  (3)  is 
included,  the  coefficients  must  be  given  as  functions  of  U and  p.  The  empirical  values 
for  these  coefficients  are  given  in  Section  A. 
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Some  typical  results  for  1cm  microwave  radiation  are  presented  in  the  form  of 
computer  generated  plots.  Each  plot  shows  the  power  pulse  shape  as  a function  of 
time  at  the  front  surface  and  at  four  equally  spaced  distances  as  solid  line  curves.  The 
mid  range  and  final  range  in  feet  are  indicated  along  the  z axis.  Zero  time,  located  at 
the  center  of  the  pulse  at  z = 0,  propagates  with  the  velocity  of  light.  Thus  the  abscissa 
in  nanosecond  units  is  the  local  time  at  each  of  the  distances  for  which  results  are  dis- 
played. The  pulse  width  may  be  read  off  the  abscissa.  The  ordinate  is  labeled  in  units 
appropriate  for  the  power.  Since  only  upper  case  characters  can  be  displayed,  the 
power  P(z,  t)  is  designated  POW  while  the  pressure  p is  designated  P on  these  plots. 

The  peak  value  of  POW  relative  to  breakdown  power  is  given  in  the  legend  at  the  top. 

Also  given  is  the  pressure,  P,  in  Torr.  The  dotted  curve,  at  each  distance,  is  the 
electron  average  energy  U relative  to  the  neutral  gas  average  energy.  This  quantity 
has  been  arbitrarily  scaled  for  display  purposes.  The  scale  may  be  determined  by 
referring  to  the  value  of  TMAX  in  the  legend.  This  is  given  in  eV  and  corresponds  to 
the  maximum  value  of  U,  for  the  largest  distance  displayed,  multiplied  by  1/40  eV, 
i.  e.  , the  assumed  neutral  gas  average  energy.  At  the  largest  distance,  the  second 
dotted  curve  represents  the  electron  density  again  arbitrarily  scaled.  The  scale  is 

given  by  the  value  of  NMAX  in  the  legend.  The  units  are  electrons  per  cubic  centimeter. 

2 

The  third  line  in  the  legend  gives  the  peak  power  in  W/cm  as  well  as  the  pulse  energy 
2 

in  j/c  m1".  The  pulse  energy  applies  to  the  pulse  at  the  largest  displayed  distance. 

The  values  for  all  other  parameters  are  discussed. 

Some  results  for  P=100  Torr  are  given  in  Figs.  1 (a) , (b),  (c) . Note  that  the  initial 
pulse  is  approximately  30  ns  wide.  For  POW  = 10,  the  pulse  width  is  reduced  by  per- 
haps one  third  in  5,  000  feet.  For  POW  = 5,  the  pulse  is  not  noticeably  changed.  For 

13/3 

POW  = 12,  the  electron  density  at  the  front  surface  is  1.19x10  /cm  which  exceeds 
the  critical  value  of  1.12x10^  /cm  ^ and  therefore  the  pulse  can  not  propagate  within 

*7 

the  range  of  validity  of  the  model.  Notice  that  for  POW  = 10,  NMAX  is  only  2.  34  x 10  / 
cm^  at  5,000  feet.  At  the  front  surface  the  value  is  3.  40  x lo'^/cm^,  Table  I gives 
further  indication  of  the  distance  dependence.  A comparison  of  the  pulse  energies  at 
0 and  5,000  feet  confirms  the  estimate  made  above  concerning  the  reduction  in  pulse 
width . 

Figures  2(a)  and  2(b)  show  results  for  a 30ns  pulse  at  380  Torr  for  POW  equal  to 
2 and  4 respectively.  Figure  2(c)  indicates  that  at  POW  =5,  the  pulse  cannot  propagate 
within  the  range  of  validity  of  the  model.  When  Fig.  2(b)  is  compared  to  Fig.  1(c),  it 
may  be  noticed  that  at  380  Torr  the  pulse  energy  is  about  five  times  larger  than  at  100 
Torr  for  nearly  similar  attenuation,  i.e.,  similar  electron  densities.  This  is  qualita- 
tively accounted  for  by  the  fact  that  the  breakdown  power  increases  as  the  square  of  the 
pressure  while  the  collisional  frequencies  increase  linearly  with  pressure. 
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Fig.  1.  Propagation  through  atmosphere 
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1 TABLE  I.  Distance  dependence  of  pulse  energy  and  electron  density 

POW  = 10,  P=100  Torr,  Pulsewidth  = 30  ns. 


z(ft) 

N (cm 

max 

Pulse  Energy  (j/cm  ) 

0 

3.  40  x 1010 

6.  744x  10'  3 

0.  1 

3.  30x  1010 

6.  736x  10*  3 

1 

2.65  x 1010 

. 3 

6.67  x 10 

10 

8. 80  x 109 

6.34  x 1 0*  3 

100 

1. 13x  109 

8 

5.71  x 10*  3 
- 3 

1000 

1.17x10 

5.03  x 10 

- 3 

5000 

2. 34x  107 

4.46  x 10 

At  760  Torr  a 30  ns  pulse  with  POW  = 2,  can  not  propagate  within  the  range  of 

validity  of  the  model.  Rather  than  investigate  powers  between  1 and  2,  we  changed  the 

pulse  width  to  10ns.  In  Figs.  3(a),(b),(c),  the  results  for  POW  = 5,  5. 5 and  6 are  shown . 

Note  the  critical  density  is  barely  exceeded  at  POW  = 6.  At  POW  = 5 . 5 the  pulse  is 

8 3 

significantly  shortened  at  5,  000  feet.  The  electron  density  is  1.29x10  /cm  and  the 
pulse  energy  is  4.  81  x 10  ^ j/cm”3.  At  1 foot  the  electron  density  is  2.  32  x lo'  ^ /cm 
and  the  pulse  energy  is  7.02x10  j/cm  . A comparison  of  Figs.  3(a)  and  3(b)  indicates 
that  at  POW  = 5,  the  pulse  is  less  attenuated  and  somewhat  more  energy  is  available  at 
5,000  feet  than  at  POW  = 5.5  albeit  at  somewhat  lower  peak  power. 

At  100  Torr  and  POW  = 30,  a 10ns  pulse  cannot  propagate  within  the  range  of 
validity  of  the  model.  With  POW  = 29,  the  results  are  given  in  Figs.  4(a),(b),(c)  for 
0.  1,  10  and  5,000  feet,  respectively.  To  attain  a distance  of  5,000  feet,  about  half  the 
pulse  must  be  sacrificed  at  this  power  level.  The  electron  density  at  5,  000  is  again  of 
the  order  of  107/cm3.  A comparison  of  Fig.  1(c)  and  4(c)  indicates  that  although  both 
pulses  started  with  roughly  the  same  energy,  about  25'’fo  less  energy  gets  out  to  5,000 
feet  for  the  10ns  pulse  than  for  the  30ns  pulse.  The  electron  density  is  1.5  times 
larger  at  5,000  feet  for  the  10ns  pulse.  However,  at  0.  1 feet  the  electron  density  is 
50  times  larger  (compare  Fig.  4(a)  with  Table  I). 

At  380  Torr,  the  results  for  POW  8,  5 and  2 are  shown  in  Figs.  5(a),  (b)  and  (c). 
For  POW  = 10,  a 10  ns  pulse  cannot  propagate  within  the  range  of  validity  of  the  model 
at  380  Torr.  These  may  be  compared  to  Figures  2(a)  and  (b). 

The  ptots  discussed  above  cover  only  a small  portion  of  the  parameter  range  which 
may  be  investigated.  Further  work  particularly  relating  to  the  evaluation  of  the  validity 
of  empirically  determined  parameters  is  still  required.  The  extension  of  the  model 
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to  include  transverse  spatial  dependence  is  yet  to  be  accomplished.  More  important 
than  any  of  the  above  is  the  need  to  validate  the  main  features  and  approximations  in- 
volved in  the  formulation  of  the  model.  This  may  be  done  by  comparison  with  experi- 
ments designed  to  test  the  conclusions  drawn  from  the  model.  The  predicted  pulse 
shortening  for  sufficiently-high  pulse  powers  provides  a distinctive  feature  to  test  the 
validity  of  the  theory.  Alternatively,  a careful  derivation  based  on  a microscopic  theory 
may  validate  the  formulation  but  not  the  empirical  parameters  used.  Such  a treatment 
must  properly  include  the  effects  associated  with  the  drastic  environment  provided  by 
the  intense  fields.  Such  effects  would  include  relativistic  and  turbulent  modifications 
of  the  ordinary  modes  of  interaction  of  the  field  and  matter. 

A.  Appendix 

A brief  summary  of  the  data  used  in  the  computer  modeling  follows.  A number 
of  choices  were  made  as  to  which  of  several  alternative  values  or  approximations  was 
to  be  used  in  the  computations.  Refinements  have  been  introduced  and  are  continuing 
to  be  introduced. 

The  data  on  ionization  and  attachment  frequencies  are  often  given  in  terms  of  the 
coefficients  and  q respectively  where 


and 


v = v.ri 
a d 'a 

and  vd  is  the  electron  drift  velocity.  The  ionization  coefficient  is  well  represented  by 

X Ae'BP/E 


where  for  A = 15  (cm-  Torr)’1  and  B = 365  V/(cm-  Torr)  for  E/p  in  the  range  100-  800  V/ 
(cm-Torr).  The  data  of  Harrison  and  Ceballe2  for  E/p  in  the  range  25-65  V/(cm-  Torr) 
may  be  fitted  with  A = 3. 12  (cm- Torrf 1 and  B = 201  V/(cm- Torr).  The  latter  has  been 
used  in  our  calculations.  Other  representations  for  v.  are  in  the  literature.  For  ex- 
ample, Mayhan  et  al. , give: 

v.  = f(T,p)(|)5-34 


where  f(T,p^is  a given  function  of  temperature  and  pressure.  At  room  temperature 
f = 8.  35  xlO  . This  representation  has  not  been  used  in  any  of  our  computations,  since 
dependence  on  air  temperature  has  not  been  considered. 


A 
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The  attachment  coefficient  data  is  somewhat  less  certain.  The  data  of  Harrison 

Z 4 

and  Geballe  for  E/p  in  the  range  25-60  V/(cm-Torr)  and  that  of  Chatterton  and  Craggs 

in  the  range  2-  30  V(cm- Torr)  are  in  fair  agreement  over  the  overlapping  region 

25  < E/p  < 30.  For  the  first  calculations,  a rough  compromise  was  struck  by  using 


E,2 


— = a + b ( — ) 
P 'P 


with  a = 5. 68x10* *(cm- Torr)  1 and  b = 3.  05  x 10  ^ (cm- Torr/V)V(cm- Torr).  This  was 
not  entirely  satisfactory  since  when  extrapolated  to  sufficiently  high  values  of  E/p, 

Tj^/p  exceeded  the  corresponding  values  of  t^/p.  For  the  later  calculations  we  allowed 
r)a/p  to  saturate.  This  is  physically  plausible  and  avoids  the  difficulty  mentioned  above. 
In  particular  we  used 


a - b e 


-(E/lSp)* 


with  a = 4.  0x10  3 (cm-  Torr)  *andb  = 3.5xl0  ^ (cm- Torr)  In  any  event,  attachment 
should  not  be  very  important  when  E/p  exceeds  the  breakdown  value  by  more  than  a 
factor  of  two  or  so. 

The  drift  velocity  data  used  is  that  of  Emeleus,  Lunt  and  Meek^  for  E/p  in  the 
range  20-595  V/(cm-Torr).  This  data  is  well  summarized  by: 

Vj  = 0.125x10  7 (^^cm/iec 
where  E/p  is  given  in  v/(cm-Torr).  Since 


mv 


we  may  infer  the  momentum  transfer  collision  frequency 

£ = 1. 41  x 109  (^)1//3  (sec-Torr)'1 
A cruder  fit  to  this  data  is: 


*0 


where  Sq  = 1.  44x  10^  cm/sec  and  a^  = 1. 15  x 10^  cm^- Torr/(sec- V).  For  simplicity,  the 


latter  was  used  in  the  first  calculations,  and  the  momentum  transfer  collision  frequency 

9 -1 

was  taken  to  be  constant  and  equal  to  5x10  (sec-Torr)  . For  the  later  calculations. 


the  more  accurate  fit  was  used  for  vd  and  the  corresponding  momentum  transfer  col- 
lision frequency  was  used.  Other  data  and/or  derived  values  are  in  the  literature.  For 
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example,  Felsenthan  reports 

- = 5.3xl09  + 1. 115  xlO7  ( — ((sec-Torr)’1 
P P ' 

for  E/p  in  V/(cm-Torr).  The  author  does  not  however  consider  this  to  be  a very  ac- 
curate value.  Also  Felsenthal  and  Proud7  use 

Vj  = 7 x 106  + 2 x 10^  ) cm/ sec 

8 9 

which  is  adopted  from  Nielson  and  Bradbury.  Ryzko  reports 
Vj  = 5x  10^  + 256  xlO^(-j^)  cm/sec 

for  the  range  E/p  = 50  to  100  V/(cm-Torr). 

The  recombination  coefficient  was  taken  from  the  data  of  Sayers  as  given  in 
S.  C.  Brown's  book}9  The  value  of  y is  approximately  2.3x10  ^ (cm^/sec)  at  one  at- 
mosphere and  decreases  to  zero  as  the  pressure  is  lowered.  We  therefore  approximated; 

y - 2.3xl0'6  3/. 

p = 76o cm  /(sec- Torr)  . 

In  order  to  convert  the  empirical  expressions  given  above  to  functions  of  the 
electron  average  energy  it  is  necessary  to  relate  E/p  to  U.  This  can  be  done  for  steady- 
state  conditions  below  breakdown.  It  is  then  necessary  to  extrapolate  the  result  to 
transient  conditions  well  above  breakdown.  When  this  is  done  using  the  data  of  Huxley 
and  Zazou11  we  obtain 

U = 18.  7 ( — )2/3 
P 

for  E/p  < 25  V/(cm-  Torr).  The  date  of  Crompton  et  al.  , yields12 

U = 15  ( — )1/2 
P 

for  E/p  < 20  V/cm- Torr).  Each  of  these  expressions  represents  the  experimental  date 
to  within  20%.  We  elected  to  use  the  latter  expression. 

The  other  parameter  which  must  be  determined  is  ve,  the  energy  transfer  col- 
lisional  frequency.  The  first  calculations  with  this  parameter  used  the  value  of 
2m/M  =10  , i.e.,  the  ratio  of  electron  mass  and  neutral  gas  mass.  This  should 

adequately  model  the  transfer  of  kinetic  energy.  It  was  soon  found  to  be  a poor  approxi- 
mation and  led  to  excessively  high  values  of  U with  the  consequence  that  extremely  high 
ionization  rates  were  introduced  in  th^  course  of  the  computation.  The  cure  for  this  is 
found  when  it  is  realized  thac  the  air  molecules  can  absorb  energy  from  the  electron 
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in  the  form  of  rotational  and  vibrational  excitation  as  well  as  kinetically.  Such  proces- 
ses are  dependent  on  the  electron  energy  and  therefore  may  be  expected  to  depend 
on  U.  The  data  in  Ref.  12  shows  a dependence  for  on  E/p  which  is  roughly  given  by 


v 

— = 1.17x10' 

V 


3 


r 

for  2 < — < 20  V/(cm-  Torr).  If  we  extrapolate  this  to  much  larger  values  of  E/p  it 
must  obviously  fail  when  the  ratio  v/v  exceeds  unity.  To  avoid  this,  we  have  assumed 
that  v^/v  saturates  for  large  E/p  to  a value  of  1x10  . With  thiG  rather  arbitrary  choice, 

we  have  elected  to  use 


in  our  later  calculations.  A limited  check  for  values  of  E/p  below  30  (the  breakdown 
value)  is  provided  by  calculating  U and  comparing  to  the  values  given  in  Reference  12. 
Thi3  comparison  is  made  in  Table  II,  which  also  includes  the  value  obtained  from  the 

TABLE  II.  Comparison  of  average  energy  calculations. 


■ — l 

E/p  V/ (cm-  Torr) 

UCHS 
Ref.  16 

— 

UHZ 
Ref.  15 

UCALC 

15 

p 

5 

40.  3 

37.  2 

33.2 

10 

47.7 

86.0 

51.2 

47.2 

1 

53.0 

108.5 

61.6 

58.0 

20 

59.5 

128.0 

- . 

70.4 

— 

67.2 



expression  relating  U to  E/p.  The  agreement  is  only  fair  between  U_uc  and  U„.  _. 

More  significant  is  the  consistency  of  UCALC  and  15  (E/p)x/  . These  values  were  in 
drastic  disagreement  when  vft/v  was  approximated  as  2m/M.  Also  note  the  discrepancy 
between  the  results  of  References  11  and  12. 
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KINETIC  THEORY  TREATMENT  OF  METAL  EVAPORATION  FRONT 
M.  Newstein,  N.  Solimene  and  J.  Hammer 

We  are  concerned  with  the  evaporation  of  metal  vapor  from  a surface  exposed  to 
laser  radiation.  It  is  assumed  that  the  vapor  density  is  small  enough  that  the  laser 
energy  is  absorbed  at  the  metal  surface  rather  than  in  the  vapor.  There  is  a transition 
layer  extending  several  mean  free  paths  from  the  metal  surface  in  which  conditions 
deviate  significantly  from  local  equilibrium.  The  rate  of  evaporation  depends  on  the 
hydrodynamic  parameters,  density,  mean  velocity  and  temperature,  beyond  this  transi- 
tion layer.  The  relationship  between  these  parameters  and  the  temperature  of  the 

1 2 

metal  surface  has  been  discussed  according  to  various  models. 

Anisimov^  assumed  a form  for  the  Boltzmann  distribution  function  in  the  region 
of  the  evaporation  front  which  contained  features  of  the  physics  of  the  evaporation 
process.  Using  this  he  was  able  to  relate  the  conditions  at  the  surface  of  the  metal 
to  those  on  a plane  outside  the  transition  layer.  He  concluded  that  the  flow  of  atoms 
condensed  back  on  the  surface  of  the  metal  amounts  to  18%  of  the  flow  of  the  vaporized 
atom  s . 

In  this  paper  we  study  the  structure  of  the  evaporation  front  using  a method  which 

3 

has  been  useful  for  determining  shock  wave  structure.  Less  restrictive  assumptions 
are  made  than  those  of  Anisimov,  hence  we  are  required  to  solve  the  Boltzmann  equa- 
tion for  the  distribution  function  which  evolves  from  the  surface  distribution.  In  the 

4 

relaxation  time  approximation,  and  under  the  assumption  of  no  spatial  variation 
transverse  to  the  x-axis,  the  Boltzmann  equation  is  written: 


8f  ± 8f  1 « 

5F  + vx  = ? (+  - 0 


(1) 


where  4'/T  and  f/ T are  the  collisional  gain  and  loss  terms.  We  assume  that  the  collision 
time  T is  a function  of  the  moments  of  the  distribution  function,  and  that  the  function  4< 
toward  which  the  distribution  function  relaxes  has  the  form: 

V. .)  = "<*.  0 [-rsrtfhrF m 

where  the  density,  n,  mean  velocity,  u,  and  temperature,  T,  are  determined  by  the 
appropriate  velocity  integrals  of  the  distribution  function,  f: 

n(x,  t)  = JJJ d3v  f(x,  v,  t) 


n(x,  t)  u(x,  t)  = ///d3v  vxf(x,  v,  t) 


(3a) 

(3b) 
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3n(x,  t)  kT(rc!  t}  = ///d3v{[vx-u]2  + v2  + v2}  f(x,  v t) 


The  form  of  Eq.  (2)  together  with  Eq.  (3)  guarantees  that  the  relaxation  term  of  Eq.  (1) 
describes  two  body  collisions  which  conserve  energy,  momentum  and  particle  number. 
The  spatial  domain  of  interest  extends  from  the  metal  surface,  x=  0,  at  which  the 
time-varying  temperature  is  TQ(t),  to  a plane  x=i,  beyond  which  conditions  of  local 
equilibrium  obtain.  The  corresponding  boundary  conditions  are: 


f(x  — 0,  v,  t)  = nQ(t)  £ 


2rtkT 


f(x=  i , V,  t)  = 4 , V,  t) 


+ R f{x=  0,  -v  , v , v ) 
x y z 

for  v > 0 
x 


for  v < 0 
x 


Equation  (1),  subject  to  the  boundary  conditions  Eq.  (4)  and  the  initial  condition  f(t=  0) 
= 0,  may  be  integrated  and  the  solution  expressed  as 


f(x,  v,  t)  = f(x=  0,  v,  tr(x))  e 


" vxt(x'.  tr(x  -x') 


rx  dx>  ,,  , -* ' Jx-  vxT(x”*  tr<x -x") 

+ J v t(x',  tr(x-x'»  ^ ’ V’  tr<x-x,»e 


for  v > 0 
x 


f(x,  V,  t)  = f(x=  i , V,  tr(x))  e 


I dx1 

vxT(x' , tr(x’  - x) 


,*  dx-  . , . - JX-  vxT(x-,  tr(x"  - x)) 

+ Jx  vxTTx~'  t?rx''-~x»  *(X  > v’  tr(x*  - x))  e 

for  v <0 
x 


where 
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tr(x)  = t - — 
v 

x 


(5c) 


and  where  the  function  f and  ^ on  the  right  hand  side  of  Eq,  (5)  vanish  when  their  third 
argument  becomes  negative.  The  density  nQ(t)  and  surface  temperature  T Q(t)  are 
determined  by  the  net  energy  flux  across  the  metal  surface  and  the  equilibrium  vapor 
pressure.  Thus,  assuming  that  all  the  absorption  occurs  at  the  metal  surface,  we 
have: 


(6) 


where  c and  K are  the  thermal  diffusivity  and  conductivity  of  the  metal  and  JTis  the 
net  energy  flux  across  the  surface  given  by: 


7=  g(t)  - Jd3  vvx[v^  + vy  + vz  ] f(x  = °*  v,  t)  + nuv] 


(7) 


In  Equation  (7),  q(t)  is  the  absorbed  laser  energy  flux  and  V is  the  energy  of  vaporiza- 
tion/per atom.  The  equilibrium  density  nQ  can  be  expressed  in  terms  of  the  tempera- 
ture by  an  expression  of  the  form 


A 

no  = Te 


V 

kT 


(8) 


where  A is  an  empirically  determined  constant. 

The  evaluation  of  these  coupled  equations  for  realistic  situations  required  the 
use  of  numerical  methods.  The  interests  of  speed  and  flexibility  would  be  served  if 
the  dependence  on  the  transverse  velocity  coordinates  could  be  integrated  out  analytically. 
This  cannot  be  done  exactly,  and  the  results  of  one  approximate  procedure  will  be 
described  here.  Integrating  Eq.  (1)  over  the  transverse  coordinate  v^  and  v^  and 
defining  the  reduced  distribution  functions  by: 


F(x,  vx,  t)  = ffdv  dvz  f(x,  v,  t) 


(9a) 
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3F  9F  lr,  rl 

W + VX  = 7 c*  - F] 


The  temperature  T,  which  appears  in  'S',  cannot  be  found  from  the  reduced  distribution 
function  F,  rather  we  have 


3T  = T^  + 2Tt 


where 


^ 1 i , 7 

n = / dv  (v  - u)  F 

m «/  x x 


n "ST  = ///d3vvvf  = fJfd3v  vzl 


Thus,  to  close  our  reduced  equations,  we  require  another  equation  (and  boundary 

conditions)  for  the  transverse  temperature  T . This  may  be  found  from  Eq.  (1)  by 

2 1 
multiplying  v and  integrating.  A further  approximation 


f f f v v2f  d2v  » nu , 

JJJ  x y m 


(i.e.  the  neglect  of  the  longitudinal  conductivity  of  heat  associated  with  the  transverse 
thermal  motion)  yields  an  approximate  equation  for  T : 

5T"'rtt!rn"T,  = TT  <VTt>  • <14> 

The  boundary  condition  used  is  Tt(0)  = Tq.  The  results  for  a calculation  run  to  the 
steady  state  are  shown  in  Figure  (1)  and  (2).  Figure  (1)  illustrates  the  dependence  of 
the  steady  state  reduced  distribution  function  on  v^  and  x.  At  the  input  plane  x=  0,  the 
function  for  positive  v^  has  the  form  of  a Maxwellian  at  the  surface  temperature.  For 
negative  values  of  v^  the  surface  distribution  function  is  determined  by  the  molecules 
scattered  from  planes  with  x > 0.  As  the  coordinate  x increases  from  x=0,  the  distri- 
bution function  tends  toward  the  local  equilibrium  form  of  a Gaussian  centered  at  the 
mean  velocity  u(x).  Figure  2 illustrates  the  steady  state  dependence  of  the  moments 
n,  u,  nu  c e and  c on  x,  where 


ce  'v  3 


Ct  =^J 


can  be  identified  with  the  longitudinal  and  transverse  sound  velocities  associated  with 
the  corresponding  temperatures.  The  clearest  indication  of  the  deviation  from  conditions 
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Fig.  1.  This  figure  illustrates  the  steady  state  distribution 
function  f(x,  vx)  as  a function  of  velocity  vx>  and 
distance  from  the  metal  surface,  x.  The  temperature 
of  the  metal  surface  TQ  would,  under  equilibrium 
conditions,  lead  to  a number  density  nQ  and  sound 
velocity  C0  = *Jk T0/m  Y . These  quantities  are  used 
to  normalize  the  variable  f and  vx.  The  collision 
frequency  is  taken  to  be  given  by  1/t  = nco  and  the 
distance  variable  x is  normalized  relative  to  the 
equilibrium  mean  free  path  f0  = l/nQo.  At  the 
surface  x = 0 the  distribution  function  is  Maxwellian 
for  positive  values  of  vx  for  negative  values  it 
assumes  a form  determined  by  the  effect  of  collisions 
to  the  right  of  the  x=  0 plane.  As  x increases,  the 
distribution  function  assumes  the  local  equilibrium 
form. 

of  local  thermal  equilibrium  is  provided  by  the  difference  between  c^  and  c^.  Beyond 

a plane  about  4.5  mean  free  paths  from  the  surface  the  values  of  c^,  c{  come  together. 

This  is  the  plane  beyong  which  a hydrodynamic  description  becomes  valid. 

National  Science  Foundation 

ENG76-21829A01  M.  Newstein,  N.  Solimene  and  J.  Hammer 


Fig.  2.  Illustrates  the  kinetic  equation  solutions  lor  the 
steady-state  spatial  dependence  of  longitudinal 
and  transverse  temperature  T^/T0=  (c^/cq), 

Tt/T0  = (Ci/CQ)^>  mean  velocity  u,  and  number 
density  n.  Our  model  of  collisions  causes  relaxa- 
tion toward  one  temperature  T = 2/3  Tc  + l/3  Tj  . 

In  the  region  where  ct/ce  condition  of  local  equilibrium 
do  not  exist  and  the  hydrodynamic  equations  are 
inapplicable.  For  values  of  x greater  than  several 
mean  free  paths  the  hydrodynamic  equations  may  be 
used  to  obtain  the  relevant  moments  of  the  distribution 
function  with  boundary  conditions  determined  by  the 
kinetic  equation  solutions. 
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FUNCTIONAL  EQUATION  APPROACH  TO  THE  THREE- WAVE  NONLINEAR  INTER- 
ACTION 

S.  T.  Peng  and  E.S.  Cassedy 

The  resonant  (parametric)  three-wave  interaction  has  been  studied  in  the  nonlin- 
ear regime  as  either  the  initial- value  problem*’  ^ or  the  boundary- value  problem.*  ^ 

In  the  initial- value  problem*’  ^ pulse  envelopes  of  propagating  waves  overlap  in  an  un- 
bounded space  for  finite  intervals  of  time  either  due  to  contra-directional  propagation 

or  due  to  overtaking  (co-directional  waves  of  different  group  velocities).  The  boundary- 
1-3 

value  problem,  on  the  other  hand,  deals  with  the  interaction  of  three-waves  in  a 
bounded  region  of  interaction,  with  one  wave  (the  pump)  specified  to  have  a fixed  value 
at  one  boundary  attributed  to  a wave,  of  constant  amplitude,  incident  from  the  external 
region.  It  is  for  the  initial- value  problem  that  we  shall  show  the  application  of  the  in- 
tegral equation  approach. 

The  three- wave  problem  is  one  of  several  nonlinear  wave  problems  of  interest  in 

laser  optics  or  laser  plasma  interactions.  The  other  problems  include  the  solution  of 

5 6 m 

nonlinear  wave  equations  ' (e.g.,  the  KdV  equation  or  the  nonlinear  Schrodinger 

equation),  several  of  which  display  soliton  characteristics.  Thus  far  in  the  literature 

on  nonlinear  waves  the  predominant  analytical  approach  to  solutions  has  been  through 

5 

the  Inverse  Scattering  Transform  ’ (1ST),  and  numerical  solutions  have  typically  been 

obtained  through  differential  techniques.  The  Inverse  Scattering  Transform  approach 
is  a powerful  method  but  requires  an  auxiliary  formalism  and  multiple  steps  to  achieve 
the  wave  solution  and  differential  techniques  often  encounter  problems  of  numerical 
stability.  We  have  therefore  explored  the  nonlinear  integral  equation  as  an  alternative 
approach,  both  analytical  and  numerical. 

We  have  considered  the  system  of  first-order  nonlinear  partial  differential  equa- 
tions for  the  three- wave  interaction: 


(- 

1 at 


a . „ * * 

+ a.  + V.  -r— ) a.  = p.  a.  a. 

1 1 3 x ' 1 ri  j k 


(1) 


for 

i,  j,k  = 1, 2,  3 and  i ^ j ^ k 

where  a.  is  the  complex  wave  amplitude,  p.  the  nonlinear  coupling  coefficient,  cr.  the 

damping  constant  and  v.  the  wave  velocity  of  the  i-th  mode.  Such  a system  of  coupled 

equations  has  been  extensively  analyzed  in  the  literature  by  various  approximation  tech- 
7 2 7 

niques  or  for  special  cases.  ’ Here  we  show  the  approach  to  this  system  of  nonlinear 
differential  equations  by  the  method  of  integral  equations. 


We  observe  that  the  system  of  nonlinear  differential  equations  in  Eq.  (1)  are 
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coupled  through  the  nonlinear  term  in  each  equation.  If  the  nonlinear  term  is  viewed  as 
a known  source  of  excitation,  the  three  equations  in  Eq.  (1)  then  appear  linear  and  in- 
dependent of  one  another,  and  the  solutions  can  therefore  readily  be  written  as: 

-o\t  t -o\(t  - t) 

a.  (x,  t)  = e 1 f.(x  ~ v.t)  + (3.  jB  (x-v.t  + v.T,  T)e  1 dT  (2) 

i ' i i ri  J tk  l l ' 

n J 


Bjk(x.  t)  = aj(x,  t)  ak(x,t)  (3) 

where  T is  a given  function  that  characterizes  the  initial  condition  of  the  i-th  mode. 
However,  with  the  source  functions  B.,  given  by  Eq.  (3),  Eq.  (2)  actually  forms  a sys- 
tern  of  nonlinear  integral  equations  from  which  the  wave  amplitudes  a^  still  remain  to 
be  determined.  In  comparison  with  the  original  differential  equations,  the  integral 
equations  have  the  advantage  that  the  initial  conditions  appear  explicitly  in  the  equations; 
therefore,  the  integral  equations  are  particularly  convenient  for  the  study  of  evolutions 
of  wave  packets  from  a given  set  of  initial  conditions. 

The  equations  in  Eq . (2)  contain  two  terms  which  may  be  interpreted  as  follows: 
The  first  term  is  the  propagation  of  the  initial  wave  profile  in  the  absence  of  the  non- 
linearity in  the  system.  The  second  term  accounts  for  the  nonlinear  coupling  effect. 
Furthermore,  the  integral  characterizing  nonlinear  interactions  means  physically  that 

the  field  of  the  i-th  mode  at  time  t depends  on  the  past  history  of  the  other  two  modes. 

-CTj  ft-  t) 

In  the  presence  of  damping,  the  effect  of  the  past  history  is  damped  by  a factor  e 

for  the  i-th  mode  for  0 < t < t.  Evidently,  in  the  special  case  of  heavily  damped  i-th 
2 

mode  , » 1,  the  first  term  in  Eq.  (2)  is  exponentially  small  and  the  integrand  in  the 

second  term  may  have  significant  contribution  only  in  the  vicinity:  r = t.  Therefore, 
Eq.  (2)  may  be  approximated  by: 


a.(x,t)=  (3.  a.  (x,  t)  ak(x,t)  a.  (4) 

which  means  physically  that  the  i-th  mode  depends  on  the  instantaneous  interaction  be- 
tween the  other  two  modes  at  every  point  (x)  in  space.  The  same  result  was  obtained 

2 

by  neglecting  the  derivative  terms  in  the  differential  equations. 


Next,  in  order  to  illustrate  the  integral  equation  formulation,  we  consider  the 
case  where  a^  is  the  heavily  damped  mode  (i.e.,  » 1).  Substituting  from  Eq.  (4) 

for  a^,  we  obtain  for  i = 2 and  3 the  following: 


( 


_3 

at 


cr.  + v. 

l l 


a. 

l 


(5) 


for  i,j  = 2 and  3;  i ^ j . 
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Multiplying  by  a^  , Eq.  (5)  becomes: 

(4--  2<r.  + v.  ~)I.  = 2p.  I.  I.  (6) 

'9tii9x,iriij 

I.  = |ai|2  and  p.  = (7> 

where  L (x,  t)  is  the  intensity  of  the  i-th  mode  and  is  a non-negative  real  function  of  x 
and  t.  Similarly,  the  differential  equation,  Eq.  (6),  may  be  converted  into  an  integral 
equation  and  the  result  in  this  case  is: 

A 

-Za.t  2 p.  / I.(x  - v.t  + v.t,  r)dT 

!/.«*.  ».  11  „ tt  ta\ 


I.(x,  t)  = e 


g.(x-  v.t) 


for  i,j  = 2 and  3,  and  i ^ j. 


where  g^  characterizes  the  initial  intensity  of  the  i-th  mode.  Since  L is  non-negative, 
ihe  integral  in  the  exponent  of  Eq.  (8)  is  a non-decreasing  function  of  t.  Therefore,  I. 
will  be  exponentially  decreasing  in  time,  if  cr^  / 0 and  p^  > 0.  On  the  other  hand,  if 
/ 0 and  p^  > 0,  L may  be  increasing  or  decreasing  in  time,  depending  on  the  initial 
conditions . 

Equations  (2)  and  (8)  are  nonlinear  integral  equations  from  which  the  amplitude 
(or  intensity)  of  the  i-th  mode  is  to  be  determined.  Although  such  integral  equations 
have  been  shown  here  to  be  useful  for  the  study  of  general  nonlinear  interactions  among 
the  three  waves,  they  are,  however,  not  amenable  to  direct  exact  solutions.  For  a 
numerical  analysis  of  the  nonlinear  wave  evolution,  it  is  desirable  to  approximate  the 
nonlinear  integral  equation,  for  incremental  steps  in  time. 

Treating  the  heavily-damped  case,  we  observe  that  in  Eq.  (8),  the  integral  in  the 
exponent  may  be  approximated,  for  t = t^  « 1,  to  yield: 


- 2<r  .t 

I.(x,  tj)  = e g^x-  v.t^ 


p [l  (x,t  ) + I :(*-  v -t.,  0)1 
t.)e  1 J 1 J 1 


which  is  a nonlinear  algebraic  equation  to  solve  for  T (x,  t)  in  terms  of  the  initial  condi- 
tions L(x- Vjt,  0)  (at  t = 0 but  with  a shifted  coordinate  (x- wt^)).  In  principle,  this  non- 
linear equation  can  be  solved  for  any  given  nonlinear  coupling  coefficients,  p^;  for 
p^  « 1,  Eq.  (9)  can  be  further  approximated  as: 

-2o\t.  -2<r.t.  f I.(x  - v.t. , 0) 

Ijfx,  tjJ-p.e  1 g.(x- v.- tj)  T(x,  tt)  = e g.  (x  - v.tj)  t ‘ J 1 (10) 

for  i,  j =1,2;  i ^ j 

which  forms  a system  of  coupled  linear  equations  to  be  solved  for  L and  T at  t =t^,  in 
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terms  of  L and  T at  t = 0.  The  solutions  are: 


- 2a . t. 

I^x,  tj)  = e g^x-v.^) 


piIi(x " Viti*  °)  '2Vi 
5 J *pie  «i(x-vitl)e 


pl(x-v.tl.0) 
.t. ) e J 


- i.(a  . + o . )tj 

1 ■ PjPj  e * J 8j(x-  vi4l)  " VjV 


for  i,  j = l and  2;  i / j 

The  last  equation  describes  a nonlinear  mapping  of  a set  of  initial  field  distributions  at 
t = 0 into  another  set  at  t = tj.  The  computations  of  such  a mapping  are  straightforward. 
After  I.(x,tj)  and  I.(x,tj)  are  computed,  they  can  be  used  as  a new  set  of  initial  condi- 
tions for  computations  of  the  field  distributions  at  t = t, . This  process  can  be  repeated 
as  the  algorithm  for  numerical  computations  of  the  initial- value  problem  for  given 
initial  profiles  of  the  waves. 

Using  this  algorithm,  we  have  made  calculations  for  several  examples  of  the 
three-wave  interaction  (heavily- damped  case).  One  set  corresponds  to  the  Gaussian-pulse 
parameters  used  by  Chu  and  Karney,  with  results  in  close  agreement  with  theirs  (done 
by  differential  techniques).  The  results  were  obtained  using  the  MRI  PD P- 11/40  Compu- 
tational Facility  and  are  shown  in  Figs.  1 to  3 using  its  "three-dimensional"  display 
capability.  The  two  interacting  Gaussian  pulses  are  labelled  as:  1^  and  1^  with  the  am- 
plitudes Pj  and  P^,  the  half  width  Wj  and  and  initially  at  the  positions  Xj  and  x^, 
respectively.  The  numerical  results  may  be  interpreted  as  follows: 

• mpt-  1 000009-03  Ul-  1 000009-01  U2-  5 000009-02 

XI-  2 000009-01  X2-  2 000009-01 
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Fig.  1 (parameters  - - = 1.0,  = 10  \ W^  = 10  * , W^  = 0.5x10 

Xj  = 2x10  x^  = 2 x 10  *): 

Here  the  pulse  is  too  small  to  be  seen  in  the  first  two  t;me  frames,  but  is  in- 
cident from  the  right  from  an  initial  spatially-  separated  position.  The  I ^ pulse  has 
achieved  sufficient  gain  by  the  third  frame  to  be  discernible  (the  dotted  profile).  By  the 
time  of  the  fourth  time  frame  the  pulse  has  propagated  clear  of  the  Ij  pulse,  but  dur- 
ing the  intervening  time  the  remaining  overlap  (interaction)  has  resulted  in  further  gain 
of  the  profile.  Thereafter,  in  frames  5 and  6,  both  pulses  separate  in  opposite  direc 
tions  with  no  further  (perceptible)  change  of  shape  or  amplitude.  The  total  depletion  of 
Ij  by  this  interaction  is  small. 

•mpl*  t 00000«-02  Ut-  1 «M««t-01  U2*  s «c 

xt»  a 00000»-0i  xa-  a 


Fig.  2 (parameters  --  same  as  Fig.  1,  except  P^  = 10  ^): 

Here  the  initial  1^  is  ten  times  larger  than  in  Figure  1.  As  a result,  the  growth 
of  1^  is  sufficiently  much  larger  by  the  third  time  frame  to  cause  a significant  "notch" 
of  depletion  in  the  Ij  envelope.  By  the  time  of  the  fourth  frame,  this  depletion  has 
evidently  continued  so  as  to  deplete  the  entire  trailing  (left-hand)  edge  of  Ij.  The  am- 
plitude of  I^,  of  course,  has  also  gained  considerably  more  in  the  time  lapse  between 
frames  2 and  3,  such  that  1^  is  nearly  three  times  the  amplitude  of  1^.  These  amplitudes 
and  shapes  again  remain  unchanged  in  th«  fifth  and  sixth  frames  due  to  no  further  signi- 
ficant interaction,  as  the  pulses  continue  to  separate. 


*S?:  T 
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smpl-  1 000000-01  Ul-  1 000000-01  U2-  S Mew»-e2 

XI-  2 000000-0 1 X2-  2 000000-01 


* J 
' 1 


Fig.  3 (parameters  --  same  as  Fig.  1,  except  P^,  = 10  * ): 

Here  the  initial  is  ten  times  the  previous  (Fig.  Z)  and  as  a result  the  depletion 
of  Ij  by  1^  is  even  sharper  in  the  third  frame,  leaving  an  even  smaller  Ij  in  the  subse- 
quent time  frames.  (The  profiles  of  1^  have  been  sketched  in  with  supplementary  dashed 
lines  for  added  clarity.)  The  final  1^,  profile  peak  is  about  ten  times  the  peak  amplitude 
of  Ij  for  this  case. 

8 

Further  numerical  results  and  analytical  considerations  have  been  carried  out 
for  the  highly-damped  case.  Numerical  results  show  full  depletion  and  saturation  ef- 
fects expected  for  the  model  in  the  nonlinear  regime.  Application  of  the  functional  ap- 
proach is  recommended  for  other  cases. 
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ANALYSIS  OF  STRONG  ELECTROMAGNETICALLY  INDUCED  SPHERICALLY 
IMPLODING  SHOCKS 

Y.  Fujimoto  and  E.A.  Mishkin 

A.  Introduction 

Spherical  shock  waves  present  solutions  of  the  mass,  momentum  and  energy  con- 
tinuity equations  (I),*  which  are  discontinuous  at  the  shock  front.  They  may  move  away 
from  the  center,  either  due  to  an  increased  pressure  in  the  central  domain  of  a star, 
or  to  a sudden  release  of  a finite  energy  E at  the  center  of  the  disturbance,  or  be  center 
bound- imploding  shocks  that  compress  substantially  the  sphere  of  matter  they  bear 
upon.  The  latter  can  be  induced  by  a converging  "spherical  piston"  realized,  for  exam- 
ple, by  ablated  matter  under  the  impact  of  a strong  electromagnetic  energy  pulse.  The 
increased  interest  in  implosion  is  due  mainly  to  the  possibility  of  compressing  matter 
to  densities  exceeding,  by  several  orders  of  magnitude,  that  of  solids  and  studying 
matter  at  very  high  temperatures  and  pressures.  Also,  highly  compressed  matter 
shortens,  by  several  orders  of  magnitude,  the  mean  free  path  of  the  very  energetic  a- 
particles  produced  in  a thermonuclear  reaction  rendering  it  self- sustained  and,  there- 
fore, it  is  viewed  as  a necessary  step  in  realizing  controlled  thermonuclear  fusion. 

B.  Conservation  Equations.  Self-Similar  Solution 

Spherical  shock  waves  follow  the  partial  differential  mass,  momentum  and  energy 
conservation  equations 


d^p  + p9  u + — p u = 
tr  r r 

du  + — 9 p - 0 
t p rK 

dt(P  P V)  =0 


dt  = 9t  + u 9 


(1) 


p,  p and  u denote  the  pressure,  density  and  velocity,  respectively,  of  the  ideal  gas  with 
constant  specific  heats,  y = cp/cv»  through  which  the  shock  wave  is  moving. 


We  denote  the  position  of  the  shock  front  by  Rg  and  measure  the  distance  r of 
some  arbitrary  point  behind  it  in  units  of  Rg,  4 = r/Rg-  The  imploding  shock  extends 
over  1 < 4 < oo.  The  domain  0 < 4 < 1.  corresponds  wave  that  originated  at 

the  center.  Neglecting  the  molecular  structure  of  the  ideal  gas,  the  lack  of  any  char- 
acteristic length  suggests  the  self- similar  solution,’ 


p(r,t)  = po  R*  P(4);  p(r,  t)  = p /f(4 );  u(r,t)  = R.  Uj(4)  , U> 

of  the  conservation  equations  (l).  P(4)>  ^(4)  and  Uj(4)  are  the  reduced,  or  non-dimen- 

sional pressure,  density  and  velocity,  respectively. 
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The  resultant  ordinary  differential  equations  of  the  reduced  functions  are  greatly 
simplified  by  the  velocity  transformation^ 

ut(e)  - Uj(£)  = u(t)  +e  . o) 

■U  is  the  speed  of  sound  at  the  point  £ behind  the  shock  front  where  the  pressure  is 
^ m r 

maximum.  With  constant,  expressions  (2)  together  with  the  transformation  (3) 
lead  to  the  set  of  ordinary  different. al  equations  in  P,  f and  U, 

- P 1 d|  = U_1d£  U + 24'1  + 3U'1  , 

~f°~  1d^P  = Ud^U  + \ £ +(\+l)U  , (4) 

-P'ld^  P = YU‘ld^  U + 2 4*1  + (3Y  + ZXJU'1  , 

where  the  self- similar  coefficient, 
dfnR 

v = THTT  = const-  (5) 

S 


The  shock  front  at  time  t is  at, 

r,(.i  = R„(i ; tAe)“ . «M-— Jjry  • 

Using  the  notation 

a -i 

cr(4)  = exp (-/  U ‘di  ),  a(l)  = 1 ; 

I 5 

= -U'K(i)  , 

we  obtain  the  integral  forms  of  P and 

/>($)  = [-e2U(6)]'la3«){ 

p<4)  = 2-(v~1jv1  [-e2u(4)]'v  *3v+2x(4)  • 

(V  +1)Y 

Eliminating  d^  U,  Eqs.  (4)  leads  to 

(C2-  u2)^  = 2Y(|i)2  + (2V  + 2\-  Y\)^-  Y\ 


(6) 


(V 


(8) 


(9) 


where. 


r 
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c2<e>  = Y ; c(r,t)  = RSC(4) 

C(4)  is  the  reduced  speed  of  sound. 

C.  Some  Physical  Considerations  Concerning  Spherical  Shocks 

The  exploding  and  imploding  spherical  shock  waves  are  differently  generated  and 
the  underlying  physical  principles  must  reflect  it.  In  the  first  case,  a sudden  release 
of  a finite  energy  E,  at  the  center,  causes  an  outwardly  emanating  spherical  shock 
wave.  Not  too  far  from  the  point  of  explosion  the  shock  is  still  strong  and  the  pressure 
which  the  shock  wave  encounters  can  be  neglected.  The  energy  of  the  mass  of  gas  set 
in  motion  behind  the  shock  front  then  is  conserved  or  time  dependent.**  The  total  energy 
of  the  gas  behind  the  strong  shock. 


E = 4t r / 


Rs(f) 


, P x 1 2.  2 . 

( — — r tTpu  ) r d 
y - 1 2 r ' r 


(11) 


= 4 IT 


p Ro  "V  (!*  t-)5  a'  2 / trri  p(^  + i /<£,(ui+  S)1  d£  . 

ro  t c o T 

c 


is  time  independent  when, 
or  =0.4 


(12) 


The  slopes  of  the  reduced  pressure,  right  behind  the  shock  front,  follow  directly 

Eq.  (9), 

2y(v-  i) 


d,  P(l)  = -4^-i  (\-  \ ) ; X = - . x ....  .. 

5 Y2-  1 ° ° (v  + 1,(2-y~ 

d^  P>  0,  when  X < \q  , 


(13) 


(14) 


It  is  shown  in  Ref.  3 that  X must  be  smaller  than,  or  equal  to  X . In  the  wake  of 

o 

the  shock  the  pressure  P dies  out.  The  pressure  P then  must  reach  a maximum,  P 

nn 

somewhere  behind  the  front  of  the  imploding  shock  wave.  This  pressure  may  in- 
crease slightly,  reaching  a maximum  before  it  dies  out  at  the  tail  of  the  shock.  Detail- 
ed considerations  show  that  this  pressure  maximum  is  very  small  and  occurs  very 
close  to  the  shock  front,  see  Figure  1.  They  also  show  that  for  y > 2 + ■Jl,  the  pressure 

maximum  P occurs  right  at  the  shock  front, 
m 

D.  The  Postulate  of  Analyticity  by  Gelfand^  and  Butler^* 

The  shock  front  represents  a discontinuity.  Behind  it  the  pressure,  the  density, 
and  the  velocity  are  differentiable,  well-behaved  functions.  This  request  for  analytici- 
ty coincides,  or  is  congruent,  with  the  requirements  of  maximum  pressure  behind  the 


i 
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Fig.  1.  The  reduced  pressure  P(£). 


shock  front.  A spherically  meaningful  solution,  X = X(-y),  a - 1 / 1 - X,  must  be  single 
valued  and  consider  the  maximum  of  the  reduced  pressure  curve  P(£).  In  1952,  I.M. 
Gelfand  showed  that  the  function  X(-y)  spans  a whole  interval  of  values  and  "proposed  to 
choose  the  exponent  on  the  principle  of  analyticity  ? " D.S.  Butler^  found  X(-y)  with  the 
aid  of  a high  speed  digital  computer,  he  chose  as  he  writes  ".  . . . the  one  non- singular 
solution  of  a system  of  nonlinear  differential  equations^* 


Equations  (4)  lead  directly  to  the  following  equation  in  a, 

/ 7 , Ud,  U + \i  + (X  + 1)U 

2 U)'Y  3v  + 2v~3—  :r  -£ n n 

(V+1)Y  yU  d,U+2Y£  +(3y+2X)U 


(15) 


The  l.h.s.  of  the  last  equation  is  finite  and  at  the  singularity,  when  the  denominator  of 
the  fraction  on  its  r.h.s.  vanishes,  its  numerator  must  vanish  too.  Considering  again 
Eqs.  (4)  we  obtain, 

(fil-  KP*1)^  P = 0 (16) 

At  the  singularity  then  the  reduced  pressure  curve  P(£)  reaches  maximum, 

d^  P = 0 . (17) 

E.  The  Phase- Plane  U/S  =y,  dU/d£  = x of  the  Self- Similar  Solution 


JU<J  WAVE- MATTER  INTERACTIONS 

the  phase  plane  r,  r often  used  in  the  solution  of  nonlinear  problems.  The  motion  of  the 
system  behind  the  shock  front  now  is  portrayed  by  the  curve  extending  from  the  point 
U(l),  d^  U(l),  representing  the  state  of  the  system  right  behind  the  shock  front,  to  the 
shock  tail  at, 

lim  jr  = lxn  d#  U = -1  , (18) 

I - oo  5 5 

see  Figure  2.  The  C^(£)  - U^(£)  curve,  see  Fig.  3,  also  is  mapped  onto  the  same  curve. 
The  conservation  Eq.  (9)  now  reads. 


2 2 “t  ^ 2 

(C  - U ) -j^pT  = 2\  y + (2y  + 2k  - y )y  - y \ . 
When  the  self- similarity  coefficient  k equals, 


X “ = < 0 


The  discriminant  A = 0 and  Eq.  (19)  reads 
2 2 ^ 2 

<c  - U )-£p-  = 2Y(y-  ym)  . (21) 

Around  ym>  the  r.h.s.  of  Eq.  (21)  is  an  even  function  in  y.  In  the  interval, 

1 < £ < £ m»  d^  P and  C - U , are  positive.  Beyond  £ m they  are  both  negative, 

see  Figures  1 and  3.  d^  P and  C - U on  the  l.h.  s.  of  this  equation  vanish  simultaneous- 
ly at  ym,  or  £ m> 

dSP|in,»'"’c!«J-“2(Sml'0  . (22) 

At  y = y , there  is  a double  zero  on  both  sides  of  Equation  (19). 

This  proves  the  uniqueness  of  the  \ = solution  for  1 < y < 2 +-f>,  for  a dis- 
criminant A can  be  neither  negative  (there  is  no  solution)  nor  positive.  In  the  latter 

case  there  are  two  zeros  on  the  r.h.s.  of  Eq.  (21)  and  three  on  its  l.h.  s. 

Table  I shows  the  analytically  obtained  value  of  the  self- similar  coefficient  a and 
those  hitherto  obtained  numerically. 
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TABLE  I.  The  sell- similar  exponent  a. 


y 1.0 

1.  1 

7/5 

5/3 

2+3  6 

10  20 

CD 

Analytical 
values  of  0.749 

Of 

0.  734 

.707 

.68  7 

.600  .562 

.537  .518 

. 500 

a 

computed  ^ 

numerically  1.0 

.786? 

7 7 1 , 6,  7 

.6886'7 

7 

.610 

. 3758 
. 5887 
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DEVELOPMENT  OF  THE  DESIGN  FOR  IRON-CORED  SYNCHRONOUSLY 
OPERATING  LINEAR  MOTORS 

E.  Levi 

A.  Introduction 

The  first  phase  in  the  U.S.  development  of  a linear  propulsion  system  for  high 
speed  ground  transportation  is  reaching  completion.  This  seems  to  be  an  appropriate 
time  to  review  the  role  played  by  the  Polytechnic  in  the  national  program. 

The  Polytechnic  project  was  initiated  in  1973,  after  the  Department  of  Transpor- 
tation had  already  made  two  important  choices  with  regard  to  linear  propulsion:  (1)  a 
system  in  which  the  energy  would  be  supplied  to  the  vehicle,  as  opposed  to  energization 
by  the  wayside,  (2)  a system  using  iron-cored  instead  of  superconducting  magnetic. 

A wheel-on-rail  linear  induction  motor  research  vehicle  (LIMRV)  had  already 
been  tested  and  a similarly  propelled  track  levitated  vehicle  (TLV)  was  under  construc- 
tion. * The  shortcomings  of  the  induction  motor,  with  regard  to  power  factor  and  ef- 
ficiency, and  the  interest  in  attractive  magnetic  levitation  prompted  the  search  for 
alternative  propulsion  schemes.  Favorable  consideration  was  given  to  the  synchro- 
nously operating  motors  proposed  by  the  Polytechnic, 

Consideration  was  restricted  to  magnetic  field  topologies  which  were  single- 
sided and  allowed  the  rail  track  to  be  completely  passive.  To  be  determined  was  the 
maximum  economical  clearance  between  the  energized  portion  of  the  motor,  carried 
by  the  vehicle,  and  the  rail  track.  The  specifications  called  for:  thrust=  10^N,  speed  = 
134  m/s,  power  = 13.  4MW,  powe  r factor  = 1 , efficiency  >.  9.  The  calculated  pe r - 
formance  parameters  of  the  TLV  LIM  served  as  benchmarks  for  comparison. 

B,  Project  Planning 

Iron-cored,  synchronously  operating  machines  with  single  side  excitation  have 
been  used  in  the  past,  mainly  as  high  frequency  generators  at  the  kW  power  level. 

The  present  task  then  involved  transition  from  rotating  to  linear  motion,  and  scaling - 
up  by  three  orders  of  magnitude  in  power.  This  amounted  to  the  development  of  a new 
machine.  In  this  case  a step  by  step  approach  led  to  minimal  risks  and  costs. 

The  first  step  involved  a comprehensive  study  of  a matrix  of  motor  types  and 
track  clearances.  Next,  the  two  most  promising  configurations  were  to  be  selected 
for  detailed  analysis.  This  would  reveal  the  critical  issues  to  be  addressed  in  the 
next  phase.  Final  designs  for  full  scale  prototypes  and  scaled-down  models  were  to 
follow.  As  a last  step,  before  construction  of  a full  scale  prototype,  the  accuracy  of 
the  performance  predictions  was  to  be  tested  on  the  scaled-down  model.  In  the  case 
of  linear  motors,  this  means  tests  on  a large  wheel  simulating  the  track. 
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C.  Selection  of  the  Most  Promising  Configurations 

The  simplest  synchronously  operating  machine  has  a d-c  excitation  on  one  side 
(field),  and  a traveling  wave  a-c  excitation  on  the  other  (armature).  Although  such  a 
machine  does  not  satisfy  the  requirement  of  a passive  rail  track,  it  has  the  simplest 
topology  and  leads  to  the  highest  power  density  (power  output  per  unit  weight).  Since, 
in  a transportation  system,  this  is  the  most  important  parameter,  a design  was  per- 
formed for  the  conventional  synchronous  machine  and  was  used  as  reference  base. 

Attention  was  focused  on  three  machine  types  in  which  both  field  and  armature 

windings  are  carried  by  the  vehicle  side.  These  were:  (1)  the  homopolar  inductor 

2 3 

first  proposed  by  Kemper  and  later  analyzed  by  Rummich  (Fig.  1),  the  claw-pole 

4 5 

or  Nadyne  first  proposed  by  Pierro  (Fig.  2),  and  the  heteropolar  inductor  machine 

(Figure  3). 

In  the  homopolar  inductor  machine,  the  field  winding  is  interlinked  with  a trans- 
verse d-c  flux;  the  armature  winding  is  divided  into  two  sections,  each  interlinked  with 
both  a longitudinal  a-c  flux  and  the  transverse  d-c  flux.  As  a result,  in  the  armatures 
the  magnetic  flux  density  B oscillates  around  an  average  value,  instead  of  alternating 
between  a positive  and  a negative  peak,  as  in  conventional  machines. 

The  same  occurs  in  the  heteropolar  inductor  machine,  even  though  both  field  and 
armature  fluxes  are  longitudinal. 

In  the  claw -pole  machine,  instead,  the  magnetic  flux  path  is  twisted  from  trans- 
verse to  longitudinal,  while  it  is  forced  to  traverse  two  additional  air  gaps. 

Two  clearance  lengths  (1.5  cm  and  3 cm)  were  considered.  The  heteropolar 
inductor  machine  carries  an  alternating  flux  through  the  rail  and  hence,  requires  a 
finely  laminated  rail  track.  Therefore,  it  was  not  deemed  suited  for  ground  rail 
transportation.  This  left  the  homopolar  inductor  and  the  claw-pole  types  as  sole 
contenders.  Motivation  for  pursuing  their  development  was  provided  by  the  fact  that 
both,  according  to  the  analysis,  outperformed  the  T LV  LIM.^ 

With  regard  to  the  homopolar  inductor  type  it  must  be  noted  that,  due  to  its 
asymmetry,  the  flux  swing  and,  hence,  the  output  is  severely  limited  by  iron  satura- 
tion effects.  This  limitation  is  compounded  by  the  fact  that  saturation  also  hinders 
the  variable  reluctance  effects  on  which  the  machine  relies  for  its  operation. 

On  the  other  hand,  in  the  claw-pole  machine,  the  additional  mmf  requirad  to 
overcome  the  double  air  gap  enhances  the  role  of  leakage  fluxes.  As  it  turns  out,  the 
weight  penalty  imposed  by  these  fluxes  limits  the  usefulness  of  the  machine  to  clearances 
well  below  1 cm.  For  this  reason,  further  development  was  concentrated  on  the  in- 
ductor motor. 
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A sensitivity  analysis  was  then  performed  to  evaluate  the  effects  of  rail  clearance, 
rail  geometry,  power  factor,  and  pole  face  length  on  the  power  density.  As  a result 
of  this  analysis  and  other  considerations  the  clearance  of  1.5  cm  was  selected. 

One  of  the  features  of  synchronously  operating  machines  is  that  the  three  com- 
ponents of  force,  thrust,  levitation,  and  guidance  can  be  measured  in  static  tests.  It 
was  thus  possible  to  check  the  analytical  predictions  against  actual  tests  made  on  in- 
expensive models.  It  was  found  that  the  one -dimensional  field  mapping  on  which  the 
calculations  were  based  failed  to  provide  adequate  accuracy.7  This,  in  ,,  articular, 
invalidated  the  performance  predictions  with  the  larger  clearance  (3  cm).  It  also  in- 
dicated one  of  the  critical  issues:  the  need  for  accurate  flux  mapping. 

D.  Flux  Mapping 

o 

Inductor  machines  were  studied  at  the  beginning  of  the  century  by  Wieseman. 

Using  graphical  flux  plotting  techniques,  he  determined  the  optimum  pole  arc  to  be 
Ct  = 60  . His  conclusions,  however,  were  based  on  poles  having  rectangular  profiles 
and  on  neglect  of  armature  excitation  and  saturation  effects.  The  variable  reluctance 
effects  and,  hence,  the  performance  can  be  enhanced  by  pole-shaping.  Moreover, 
more  realistic  boundary  conditions  should  be  chosen  for  plotting  the  magnetic  flux 
distributions. 

A computer-aided  method,  based  on  conformal  mapping,  was  developed  for 
determining  the  air  gap  flux  distribution  in  an  arbitrary  geometry  and  with  arbitrary 
excitation^  (see  comparison  report). 

E.  Design  Optimization 

The  design  of  an  electrical  machine  is  usually  performed  by  trial  and  error, 
starting  from  assumed  dimensions  and  ending  with  prediction  of  performance.  How- 
ever, when  dealing  with  a novel  application,  it  is  more  expeditious  to  start  directly 
with  the  design  specifications,  express  them  in  terms  of  electric  and  magnetic  stresses, 
establish  the  appropriate  geometry  and  finally  determine  the  dimensions.  This  ap- 
proach was  adopted  for  the  design  of  the  full  scale  prototype  and  for  its  scaled-down 

a i 10 
model. 

The  main  design  premises  were: 

(a)  iron  saturation,  rather  than  thermal  stress,  is  the  basic  limitation 
of  the  machine. 

(b)  supply  by  a current-source  inverter  under  closed  loop  operation  allows 
selection  of  the  angle  ij r between  the  direct  axis  and  the  armature  current 
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It  was  found  that,  for  unity  power  factor,  there  exists  an  optimurn  value  « 50°. 

This  choice  of  is  characterized  by  a significant  reduction  of  the  additive  effects  of 
the  armature  reaction  on  the  air  gap  flux  density  under  the  pole  faces.  In  turn  this 
reduces  saturation  of  the  armature  teeth,  the  most  stressed  part  of  the  iron.  In  addi- 
tion, at  the  same  time,  this  choice  of  maximizes  the  thrust  per  unit  gap  area  and, 
hence,  for  a specified  power  output,  tends  to  lead  to  a machine  of  minimum  size. 

F.  End  Effects 

One  of  the  advantages  of  linear  synchronous  machines  is  that  they  do  not  rely 
for  their  operation  on  induced  currents.  Therefore,  the  end  effects  which  seriously 
curtail  the  output  of  linear  induction  motors  can  be  alleviated.  Nevertheless,  it  was 
necessary  to  evaluate  the  undesirable  finite  length  effects  of  the  homopolar  inductor 
motor.  ^ It  was  found  that  the  permeance  fluctuations  cause  negligible  losses.  More 
important  is  the  possible  development  of  unbalanced  forces.  Undesirable  yaw,  pitch, 
and  roll  motions  may  be  minimized  by  making  the  overall  length  of  the  motor  equal 
to  an  integer  number  of  pole  pairs. 

G.  Laminations  of  the  Rail  Track 

The  question  of  whether  or  not  the  rail  track  should  be  laminated  and,  if  so, 
what  should  be  the  lamination  thickness  has  important  economic  implications.  The 
problem  does  not  easily  lend  itself  to  a rigorous  analysis.  It  was,  therefore,  investi- 
gated following  four  different  approaches.  ^ From  these  studies,  it  appears  that  eddy 
currents,  if  allowed  to  flow  unhindered  in  the  rail,  effectively  prevent  the  establishment 
of  a flux  over  the  leading  portion  of  the  motor.  Hence,  the  use  of  a solid  rail  is  not 
feasible  and  laminations  are  necessary.  An  approximate  analysis,  which  accounts  for 
saturation  effects,  leads  to  the  conclusion  that  the  laminations  need  not  be  wider  than 
6 mm.  This  thickness  is  believed  to  be  structurally  adequate,  economically  feasible 
and  leads  to  negligible  losses.  A rigorous  analysis,  based  on  the  periodic  structure 
approach  is  in  progress. 

H.  Conclusions 

An  investigation  of  various  types  of  iron-cored,  synchronously  operating  linear 

motors  has  shown  that  for  clearances  in  excess  of  1 cm.,  the  homopolar  inductor  type 

is  the  most  promising  linear  motor.  This  conclusion,  as  well  as  the  accuracy  of  the 

analysis,  has  been  confirmed  by  a comparative  evaluation  of  the  linear  inductor  and 

1 2 

induction  motors  carried  out  by  General  Electric  on  112-kW  models.  Further 

1 3 

confirmation  is  provided  by  models  constructed  at  the  University  of  Kentucky, 

14  15 

University  of  Toronto,  and  the  Technical  University  of  Munich. 
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COMPUTER-AIDED  CONFORMAL  MAPPING  OF  MAGNETIC  FLUXES  IN  SATURATED 
INDUCTOR  MOTORS 

E.  Levi,  J.P.  Lee,  F.  Lalezari  anti  M.  Gemelos 
A.  Introduction 

A simple  approach,  developed  ior  the  design  o£  linear  motors,  leads  to  a quick 
evaluation  of  the  machine  reactances  and  of  their  dependence  on  iron  saturation.  The 
method  consists  of  conformal  mapping  by  means  of  the  Schwarz- Christoffell  transforma- 
tion coupled  with  numerical  integration.  This  allows  the  handling  not  only  of  a large 
number  of  pole  shapes,  but  also  of  variable  potential  distributions  at  the  boundaries. 


B.  Salient- Pole  Shaping 


An  inductor  motor  for  linear  electric  propulsion  is  sketched  in  Figure  1.  Both 
the  field  and  armature  windings  are  located  in  the  same  structure,  while  the  rail  track 


Fig.  1.  Sketch  of  homopolar  inductor  motor, 
is  passive.  Since  the  machine  relies  for  its  operation  on  variable  reluctance  effects, 
the  primary  design  choice  is  a pole  profile  which  accentuates  the  variation  in  reluctance. 
This  effect  is  most  simply  studied  using  field  excitation  only.  A two-step  Schwarz- 
Christoffell  transformation  reduces  the  idealized  geometry  of  Fig.  2(a)  to  the  parallel 
plate  configuration  of  Figure  2(b).  Plots,  such  as  shown  in  Fig.  3,  giving  the  harmonic 


Fig.  2.  Salient  pole  profile. 

(a)  Original  (b)  Transformed. 
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Fig.  3,  Average  value  and  first  3 harmonic  amplitudes  vs. 
pole  face  length  (H  = lOg). 


coefficients  C^  (amplitude  of  the  flux  density  harmonic  content  normalized  to  the  peak 
field)  guide  the  choice  of  pole  shapes  and  dimensions.  For  instance,  Bq  determines  the 
cross  section  of  the  field  yoke  and  of  the  passive  rail  track  and,  hence,  affects  the 
weight  and  cost  of  the  propulsion  system.  In  contrast,  for  a given  armature  area,  sur- 
face current  density,  and  power  factor,  B^  is  proportional  to  the  machine  output.  Hence 
Cif/C0f  is  an  index  of  the  output/weight  ratio  and  a figure  of  merit  for  the  system  de- 
sign. Figure  3 shows  that  the  pole  face  length  f should  be  approximately  half  of  the 
pole  pitch  t.  Similar  plots  indicate  that  little  is  gained  by  making  the  depth  of  the  inter- 
polar  space  H more  than  6 times  the  gap  length  g. 

C.  Determination  of  the  Unsaturated  Reactances 


The  mutual  reactance  X^  between  field  and  armature  is  given  by  times  the 
magnetizing  reactance  Xm  for  a uniform  gap  of  length  g.  Determination  of  the  armature 
reactance  Xacj  and  X is  slightly  more  complicated.  The  Schwarz- Christoffell  trans- 
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formation  distorts  the  armature  boundary.  Consequently  the  magnetostatic  potential 
distribution  along  the  armature  is  deformed  and  must  be  calculated,  before  it  is  applied 
to  the  parallel  plate  configuration.  However  the  field  distribution  can  be  determined 
analytically  and  is  easily  transformed  back  into  the  original  geometry.  The  resultant 
reactance  coefficients  are  shown  in  Figure  4.  For  the  sake  of  comparison,  Figs.  3 and 
4 also  show  the  values  of  the  reactance  coefficients  obtained  by  the  conventional  1 -di- 
mensional (straight- Line)  approximation . 


Fig.  4.  Direct  and  quadrature  coefficients  vs. 
pole  face  length  (H  = lOg). 

D.  Correction  for  Iron  Saturation 

The  same  technique  is  applied  to  account  for  saturation  effects.  The  magnetic 
flux  in  the  air  region  resulting  from  the  combined  field  and  armature  excitations  is  sub- 
divided into  tubes  of  equal  flux  for  which  the  computer  calculates  the  coordinates  along 
the  iron.  The  corresponding  m.m.f.  drops  are  evaluated  with  the  help  of  the  iron 
magnetization  curve.  The  deviations  in  magnetostatic  potential  resulting  from  these 
drops  are  then  treated  as  additional  excitations.  Superposition  applies,  since  the  air 
region  is  linear.  A first  order  correction  is  thus  introduced  in  the  flux  distribution. 

The  process  can  be  iterated,  if  more  accuracy  is  needed. 
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E.  Comparison  with  Experiment 

A homopolar  inductor  linear  motor  with  nominal  output  of  112kW  was  built  by 
General  Electric  and  tested  on  a 1 . 35  m diameter  wheel.^  The  60  Hz  open-circuit  satura- 
tion curve  data  obtained  during  one  of  these  tests  is  shown  as  the  solid  curve  in  Figure 
5.  Predictions  based  on  the  theory  described  earlier  are  shown  for  comparison  as  3 
calculated  points  in  the  figure.  Excellent  agreement  is  apparent. 


Fig.  5.  Open  circuit  saturation  characteristics. 
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MODAL  REPRESENTATION  OF  E-M  FIELDS  IN  LAMINATED  MOVING  CONDUCTIVE 
FERROMAGNETIC  MEDIA 

B.R.  Cheo,  E.  Levi  and  K.C.  Chang 

A.  Introduction 

Early  in  the  development  of  electrical  machines,  some  150  years  ago,  it  was  real- 
ized that  the  portions  of  magnetic  flux  path  carrying  an  ac  flux  need  to  be  laminated  in 
order  to  reduce  the  effects  of  eddy  currents.  Yet  the  accurate  evaluation  of  the  effects 
is  still  an  unresolved  problem.  The  difficulty  stems  from  two  sources:  geometrical 
complexity  and  iron  saturation.  Most  of  the  engineering  purposes  have  been  served  by 
approximations  which  are  valid  in  the  case  of  thin  laminations.*  Technological  prog- 
ress in  materials  and  manufacturing  have  made  the  use  of  such  thin  laminations  econom- 
ically feasible . 

However,  in  applications  of  the  recently  developed  interests  in  magnetic  propul- 
sion of  vehicles,  the  track  rail  itself  has  become  part  of  the  magnetic  circuit.  Econom- 
ic and  structural  considerations  have  made  it  essential  to  use  the  coarest  possible 
laminations  . 

This  report  addresses  itself  to  the  first  step  of  the  classical  problem  of  thick 
laminations:  the  modal  structure  of  EM  fields  in  a laminated  moving  conductive  ferro- 
magnetic medium  and  the  boundary  value  problem  associated  with  the  excitation  in  an 
air  gap. 

B.  Formulation  and  Coordinate  System 

We  consider,  according  to  Fig.  1,  a half  space  of  laminated  structure  occupying 
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the  region  z > 0.  The  medium  moves  with  a constant  velocity  v = T v.  The  static  sta- 
tionary sources  are  confined  in  the  region  z < 0.  For  simplicity  we  assume  that  they 
are  uniform  in  the  y direction,  normal  to  the  lamination  sheets. 

The  analysis  is  carried  out  in  a coordinate  system  fixed  to  the  sources.  Since 
the  structure  seen  by  the  sources  is  unvarying,  time  dependence  is  eliminated  from 
the  problem. 

C.  Fields  in  the  Air  Gap 

We  assume  that  in  the  absence  of  the  lamination  structure,  the  sources  produce 
a known  source  field  Hs(x,z)  that  is  expressible  as  a Fourier  integral: 
so 

Hs(x,z ) = / F(z,  p)  exp(jpx)  dp  mi 

- so 


where 


— 1 — 

F(z,  P)  = ~T~  f H (x,  z)  exp(-jpx)  dx 


(2) 


In  the  presence  of  the  moving  lamination  structure,  an  additional  set  of  electro- 
magnetic fields  will  be  induced  within  the  lamination  and  in  the  air  so  that  the  boundary 
conditions  at  the  interface  z = 0 are  satisfied.  We  first  establish  the  modal  form  of  the 
induced  fields  due  to  a source  field  F(z,  p)  exp(jpx).  Both  E and  H fields  will,  in  gener- 
al, be  present  and  each  can  be  expressed  as  the  gradient  of  a scalar  potential.  They 
all  have  exp(jpx)  dependence  and  the  true  fields  are  obtained  through  the  Fourier  trans- 
form in  the  form  of  Equation  (1). 

Let  the  total  electromagnetic  field  in  the  air  region  be  expressed  as; 


Hg  = F(z,  P)  exp(jpx)  - V 'J' 


* = Z bm  exp(j2rmry/d)  sinh[  o (z  + g)]exp(jpx) 


(3) 


m=-  <*> 


E = -V$ 
g 


$ = Z am  exp(j2rmry/d)  sinh[  £ (z  + g)]exp(jpx) 


(4) 


m = -» 


where 


+ ( 


2mir 

d 


v2$ 


0,  V <b  = 0 
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The  hyperbolic  sine  sinh[  am(z  + g)]  dependences  in  the  magnetic  potential  it  and 

the  electric  potential  4>  are  chosen  according  to  the  symmetry  of  the  geometry.  This 

choice  implies  the  assumption  that  z = -g  is  an  equi- potential  surface  for  both,  and  is 

taken  to  be  the  reference.  The  coefficients  a and  b are  to  be  determined. 

m m 

D . Modes  in  the  Laminations 

In  the  iron,  we  seek  a set  of  modes  which  are  solutions  to  the  Galilean  transform- 
ed Maxwell's  equations  and  satisfy  the  boundary  conditions  at  the  edge  of  laminations 
y =0,  id,  + 2d,  • • • • . The  solution  will  be  periodic  in  the  y-direction  with  periodicity 
d. 


The  Galilean  transformed  Maxwell's  equations  considered  are: 
VxE  = 0 

VxH=J  = o(E+pVxH) 

V • H = 0 


(5) 

(6) 
(7) 


where  v = a^v,  cr  and  p are  the  velocity  conductivity  and  permeability  of  the  moving 
medium.  The  fields  E,  H here  are  those  seen  by  the  stationary  observer,  i.e.,  the 
observer  at  rest  with  respect  to  the  source  currents.  The  laminations  are  separated 
by  infinitesimally  thin  perfect  insulations.  The  boundary  conditions  are: 

(1)  All  components  of  H are  continuous. 

(2)  Normal  component  of  J = 0 (J^  = 0)  at  the  edge  of  the  laminations 


(y  = 0,  ±d,  i 2d, 


). 


(3)  A finite  voltage  may  exist  across  the  insulation. 


All  fields  are  assumed  to  have  exp(j(3x)  behavior  as  stated  previously. 

It  can  be  shown  that  there  are  two  types  of  modes  which  exist  in  the  medium  satis- 
fying the  above:  The  first  type  is  the  TM  modes  which  consist  of  both  E and  H fields. 
The  second  type,  the  TE  modes,  consists  of  the  H field  only. 

It  can  be  shown  that  a complete  representation  of  the  total  fields  is  given  by  the 
following  equations. 


jpE 


E = — — ^ sinh[y(y  - y)]exp(-ot)  + \ E ( ) cos( )exp(- a 


Li  n'  n» 
n = 1 


z) 


(8) 


E„  = E^{cosh  [\(y  - )1  - sinh(-^p  ) 6(y  - d)}  exp(-a  z) 


y o 

30 


+ ZEn[sm(^) 

n=  1 


6(y  - d)  ] exp(-  a ^z) 


2d 

n 7T  n 


(9) 
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-aE  oo  a d 

Ez  = — ^ sinh[v(y-  |)]exp(-<rz)  + £ En( ) cos  ^ exp(- o^z) 

n = 1 

Hx=^f  Eo  cosh[y(y-  -|)]exp(-o2) 


+ n?i[  <jP  ~ MV)  cn  H"  ‘ V(£)2eJ  sin(^)exp(-anz) 


(10) 


(11) 


Hy  = l.  Hn£n  + 1 cos^exp(-V) 
n = l 


(12) 


Hz  = Eq  cosh  tv(y  - |)]  exp(-az) 
V 

so  & d 


+ nSl-  7T£n  + lHn*  jP<r(^T)2En]  sln^expf-^z) 


(13) 


where: 


E : modal  amplitudes  of  the  TM  modes 
n r 

H^:  modal  amplitudes  of  the  TE  modes 
c.  = 1 for  j = odd  and  e.  = 0 for  j = even 

« = IP  I 

V = (jPRv)1/2 

«n  = [ P2  + (^f)2  + jpff  Mv]  1//2 

The  e(jpx)  dependence  is  understood  throughout.  The  expressions  above  are  valid 
for  the  unit  cell:  0+<  y < d +,  i.e.,  for  the  region  including  the  first  lamination  and 
the  first  insulation  sheet  to  the  right  of  x-z  plane.  The  fields  extend  from  y = -oo  to 
y =°°  periodically  from  cell  to  cell  with  periodicity  d.  The  6-function  gives  rise  to  a 
finite  voltage  between  the  laminations  and  is  needed  for  (f)  E • di  =0  everywhere. 

The  terms  corresponding  to  L ^ in  Eqs.  (8)  to  (13)  are  those  of  the  dominant  mode. 

The  identity  at  - |p  | indicates  that  this  mode  has  the  least  attenuation  in  the  z-direction 

and  thus  corresponds  to  the  deepest  penetration  into  the  iron  from  the  air  gap.  It  can 

also  be  shown  that  J =0  everywhere  for  this  mode.  The  other  terms  (E  and  H ) cor- 
Y n n 

respond  to  the  higher  order  modes.  They  all  attenuate  very  rapidly  with  increasing 

distance  from  the  air  gap.  In  the  limit  of  d — » 0,  only  the  dominant  mode  survives  and 

only  Ey,  and  remain.  All  other  components  vanish  as  well  as  the  current.  The 

medium  thus  behaves  as  if  <r  = 0. 


» 


I 
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E.  The  Boundary  Value  Problem 

After  establishing  the  modes  in  the  air  gap  and  in  the  iron  laminations,  we  out- 
line the  procedure  for  solving  the  modal  amplitudes  a , bm»  E^  and  The  proce- 

dure is  as  in  the  standard  scattering  problem  of  a periodic  grating;  i.e.,  by  matching 
the  total  fields  in  the  air  and  in  the  iron  at  the  interface  z =0.  The  following  boundary 
conditions  serve  as  the  basis: 

(1)  The  tangential  components  of  E and  H (x  and  y components)  must  be 
continuous . 

(2)  The  normal  component  (z-component)  of  B must  be  continuous. 

(3)  The  normal  component  of  the  current  density  J must  vanish. 

We  note  that  in  a wave  scattering  problem,  condition  (1)  will  be  sufficient  to  de- 
termine the  modal  amplitudes  uniquely.  Since  we  are  dealing  with  the  time  invariant 
Galilean  transformed  systems,  the  normal  component  continuity  also  needs  to  be  im- 
posed. The  three  conditions  above  are  not  independent  of  each  other.  Using  the  first 
two,  the  third  will  be  satisfied  automatically.  However,  it  is  often  judicious  to  use  all 
three  in  combinations.  A great  deal  of  simplification  of  the  algebra  may  result. 

To  date  we  have  looked  into  the  problem  of  the  fields  excited  by  a pair  of  line 
current  1 Iq  located  at  z = -g,  x = 0 and  x = L respectively.  The  current  at  x = 0 flows  in 
the  +y  direction  as  shown  in  Figure  1 . We  consider  first  the  case  for  one  line  current 
at  x = 0 . Then 

F(z,  (3)  = ( ax-  j sgn  (3  aj  Iq  exp  [-  | (3  | (z  + g)]  (14) 

where 


sgn  (3 


1 for  p > 0 
- 1 for  p < 0 


To  establish  conditions  (1)  to  (3),  we  first  recall  that  the  expressions  (8)  to  (13) 
are  descriptions  of  the  fields  in  the  first  unit  cell.  At  z=0,  these  expressions  can  be 
expressed  as  a Fourier  series  in  y (periodicity  d)  of  the  form  E f^  exp(j  2nn  y/d ) . By 
matching  the  coefficients  with  those  given  by  Eqs.  (3)  and  (4),  we  obtain  several  sets 
of  infinite- by- infinite  algebraic  equations  for  the  unknowns  (am>  ^m)  ar>d  (E^.H^).  After 
much  algebra,  the  following  results  are  obtained: 


E = - 
n 


y*v 3 exp('2 Ip  I s) 


8cr [ onCosh(  |p  | g)  + pr  |p  | sin  h(  |p  | g)](n2:r2  + y^d2) 


(15) 


Rr  3 


1 
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2 ,2  , . 2.  E 

_ yd(  y d + 4ir  ) y n 

° sinh(yd/2)ir  3 n=odd  n(n2  - 4) 


E = 


(16) 


1 ^ Q'dr  . 2 2 « 

3 = — -T-T^ — 7 I exp(-  |p  I g)  + 7 2-r-  I 1 - — (1  - -^-y)(-!y )]  E 

o jpSlnh(aog)j  ° n=odd  (an)3  L % vV  a2-  4 J n 


H = 0,  E =0  for  a even,  a = 0,  b =0  for  m > 0 
n n m m 

Where  e satisfies 
6n 


(17) 

(18) 


z. 


4m2) 


, m = 1, 2,  3, 
m 


(19) 


n=odd  (n 

we  notice  the  following  significant  results: 

(1)  From  Eq.  (18)  we  see  that  there  are  no  TE  modes  in  the  iron. 

(2)  Only  odd  TE  modes  exist. 

(3)  There  is  no  electric  field  in  the  air  gap. 

(4)  The  magnetic  field  in  the  air  gap  is  smooth  in  y. 

Finally,  from  the  computational  point  of  view,  the  most  significant  result  is  given 

by  Eqs.  (15)  and  (19);  Eq.  (15)  shows  that  the  modal  amplitudes  E^  are  equal  to  a known 

function  containing  all  the  parameters  (a,  g,  <r,  v,  ji  and  p)  multiplied  by  g^;  and  Eq.  (19) 

shows  that  is  a function  of  n only.  Therefore,  the  only  numerical  work  here  is  the 

solution  of  the  infinite  set  od  Eqs.  (19)  once.  The  modal  amplitudes  for  all  cases  are 

then  determined.  We  have  solved  Eq  . (19)  numerically,  and  found  the  values  of  g^  for 

all  n < 203.  It  is  of  great  interest  to  note  that,  within  the  numerical  accuracy  of  the 

computation,  g^  behaves  as  cn  2 with  c = -3.24.  This  remarkable  behavior  is  shown 

in  Fig.  2 where  g is  plotted  over  four  decades  on  a log-log  graph,  given  along  with 
- 2 n 

the  n straight  line  for  comparison. 
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COMPARISON  OF  SEQUENTIAL  PARTITION  DETECTOR  (SPD)  WITH 
OTHER  NONPARAMETRIC  SEQUENTIAL  DETECTORS 

R.  F.  Dwyer  and  L.  Kurz 

In  this  report,  the  SPD  will  be  compared  with  two  sequential  nonparametric  de- 
tection techniques  developed  by  Chadwick  and  Kurz.  * They  considered,  essentially, 
a robust  sequential  sign  test  and  its  dual  counterpart.  The  dual  test  uses  two  distinct 
nonparametric  tests  on  the  same  received  data  sample.  The  transmitted  wave  form 
consists  of  either  a signal,  sq,  of  level  uq,  or  a signal,  s^,  of  level  u^.  The  problem 
is  to  decide  which  signal  was  transmitted  in  the  presence  of  additive  noise.  The  dual 
formulation  uses  one  of  its  tests  to  sequentially  evaluate  if  sc  was  transmitted,  while 
the  other  test  is  processing  the  received  data  for  the  presence  of  s ^ . If  both  tests 
agree,  (i.e.  , sq  is  accepted  and  s^  rejected,  or  vice  versa)  to  which  signal  was  trans- 
mitted, a decision  is  made.  However,  if  they  are  in  disagreement,  another  sample  is 
taken. 

The  classical  sign  test  employs  the  statistic, 


n 


i 


where 


z. 

1 


1 if  x.  > 0 
i — 


0 if  x.  < 0 

i 


and  x.(i=  1,2 n)  are  i.i.d.r.v.  of  the  received  data.  The  thresholds  we  re  de  - 

rived  from  the  Chebyshev  and  Che  rnoff  bounds , respectively. 


2 


1 (— 

2 \1+  an  / 


1/2 


<1  + 1 

2 + 2 


1 + 


i_) 

an  / 


1/2 


(1) 


and 


i.i('-«2/n)1/2<v„<M('-«2/")'/2  m 

A simulation  was  conducted  in  order  to  compare  the  results  of  Ref.  1 for  the  se- 

9 

quential  sign  test  with  the  SPD.  The  SNR  was  defined  as  A./ o , where  A.  represents 

2 1 1 
the  shift  in  signal  level  and  o is  the  noise  variance  of  N(0,  a).  Table  I represents  the 

results  of  40  trials  for  two  cases,  A^  = . 92  and  A^  = . 46  and  the  average  sample  size 

(ASN)  is  the  average  of  the  outcomes.  Table  II  gives  the  results  from  Chadwick  and 

Kurz1  for  the  same  a and  SNR  used  in  Table  I.  Two  points  should  be  mentioned: 
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(1)  The  SPD  has  the  advantage  of  being  able  to  specify  a,  3,  and  adjusting  ASN  by 
varying  Aj  in  advance,  whereas  it  is  not  usually  possible  to  do  this  for  the  sequential 
sign  test;  and  (2)  by  increasing  m,  the  SPD  represents  a generalization  of  the  sequential 
sign  test. 

TABLE  I.  ASN  for  the  SPD  under  a shift 
of  the  mean  alternative. 


m = 2 

I 

m = 4 

m = 2 

m = 4 

A1 

. 46 

. 46 

.92 

. 92 

A 

. 92 

. 92 

. 92 

. 92 

0(A) 

. 0005 

. 0005 

. 004 

. 004 

ASN 

24.  6 

19.  95 

1 

13.95 

13.875 

a = .01;  SN  R = 0 dB 


TABLE  II.  ASN  for  the  sign  test. 


1 CHEBYSHEV 
THRESHOLD 

CHERNOFF 

THRESHOLD 

3(A)  .004 

. 004 

ASN  44.  5 

i 

9.  5 

a = .01;  SNR  = OdB 


A closer  look  at  the  SPD  for  m=  2 reveals  an  interesting  similarity.  Recall  from 
Ref.  2 that  the  test  statistic  for  the  SPD  is  given  by: 
n 2 

b < T = 7 7 b.  n.,  < a (3) 

n u U k lk 

i k 

n n 

b < bj(n  - £ ni2^  + b2  £ ni2  < a 
i i 


or 
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since , 


n = y n..  + V n.. 

La  il  Li  \t 


Therefore, 


b bl  1 y a bl 

n(b2  - bj)  b2'bi  n Y ni2  n(b2  *bj)  (b2  - bL) 

if  b^  = - b j and  b ^ = - | b ^ | , then 


n 

b .1  . 1 v ^ a .1  , , \ 

n(b,  + b ) 2 n La  ni 2 n(b,  + b ) 2 

Notice  that  Eq.  (4)  has  the  same  form  as  Equations  (1)  and  (2).  If  n -»  <»,  all  3 

test  thresholds  approach  one -half.  However,  for  n small,  the  thresholds  are  quite 
different.  In  Eq . (4),  the  thresholds  depend  upon  a,  g and  the  scores  which  in  turn 
are  derived  from  the  likelihood  ratio.  Therefore,  all  the  information  known  about  the 

sequential  test  is  given  in  the  thresholds.  This  is  not  the  case,  however,  in  Equations 

(1)  and  (2).  Also,  since  a and  b are  derived  from  a and  g,  no  initial  sample  size  must 
be  specified  for  the  SPD.  On  the  other  hand,  in  the  sequential  sign  test,  a minimum 
sample  size  must  be  given  to  insure  a given  a.  Table  III  compares  the  thresholds  as 
a function  of  n for  the  Chebyshev,  Chemoff  and  SPD  bounds.  Note  that  for  n small, 
the  SPD  has  larger  thresholds  which  insure  the  specified  error  probabilities. 


TABLE  III.  Thresholds  for  the  sign  test  using  Chebyshev, 
Chemoff  and  SPD  bounds  and  a = . 0067. 


n 

CHEBYSHEV 

CHERNOFF 

SPD 

i 

1.66  x 10‘3;  .998 

1 . 12  x 10  ; . 1 

wssm 

2 

33  x 10'3;  . 996 

1.67  x 10*3;  .998 

1 

-2;  3 

5 

8 x 10"3;  . 99 

.035;  .965 

-.5;  1.5 

10 

.016;  .98 

. 102;  . 898 

0;  1 

1 

Lower;  Upper 

Lower;  Upper 

Lower;  Upper 
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A.  The  Dual  Test 

The  ASN  can  be  reduced  further  by  using  the  dual  test  formulation.  Let  one  of 
two  signals  (sq  and  Sj)  be  transmitted  as  before.  However,  the  hypothesis  now  will  be 
the  noise  only  condition.  The  presence  of  s^  and  s^  will  be  tested  separately  against 
the  hypothesis.  This  requires  two  tests  as  discussed  above . Figure  1 represents  the 
testing  procedure  for  the  dual  test. 

IF  BOTH  Hot  AND 
Ht0  ARE  ACCEPTED 
S0  WAS  TRANSMITTED 


IF  BOTH,,  AND  H00 
ARE  ACCEPTED  S, 
WAS  TRANSMITTED 

Fig.  1.  Dual  test  sequential  testing  procedure. 

Let, 

H00:  ’ H01:  = F(x  + SD) 

and 


Hl0:  F(x)  .,  Hn:  G(x)  = F(x  -Sj) 

Then,  if  both  HQ1  and  H1()  are  accepted,  sq  was  transmitted,  or  if  both  H ^ and 
HQOare  accepted,  Sj  was  transmitted. 

Since  the  tests  are  symmetrical,  only  the  results  obtained  for  sequentially  de- 
tecting the  presence  of  sq  will  be  given. 

Consider  the  joint  probability  of  accepting  s and  at  the  same  time,  rejecting  the 
possibility  of  s^  being  present  when  Sj  was  indeed  transmitted 

p<Loi  V*.  • T„ii  W i5' 

■ P(Tnoi  V‘|)  p(Tn,  < Vl1 
’ P(Tno  * V»1>P<T„1  * VSI> 

= “o  8I  = aD 

and 


P(T  < T,  /s  ; T , T /s  ) = 6 a , = -B~ 
no  - V o nl  - a'  o'  Fo  1 PD 


(6) 


'I 
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Where  Ta>  indicate  upper  and  lower  thresholds,  a Q = false  alarm  probability 
for  dual  test,  = false  dismissal  probability  for  dual  test,  and  a ( , a , g , 8 are 
the  individual  false  alarm  and  false  dismissal  probabilities,  respectively.  The  tests 
are  considered  independent  and  they  terminate  on  the  boundaries. 

The  assumption  that  both  tests  terminate  on  the  boundaries  needs  some  explana- 
tion. It  is  reasonable  to  assume  that,  usually,  both  tests  will  not  reach  their  bounda- 
ries simultaneously.  Therefore,  one  test  will  reach  its  boundaries  first,  and  will 
continue  until  the  other  test  has  crossed  its  boundaries.  A convenient  modification 
would  be  to  terminate  the  first  test  that  crosses  its  boundaries  and  only  continue  with 
the  second.  This  procedure  would  insure  termination  on  the  boundaries  except  for  a 
small  excess  which  can  be  neglected.  However,  from  Ref.  2,  if  this  modification  was 
not  made,  the  error  probabilities  of  the  first  test  could  only  be  reduced.  Therefore, 
the  assumption  of  both  tests  terminating  on  the  boundaries  will  only  set  an  upper  limit 
on  the  error  probabilities  a,  g. 

The  overall  probability  of  false  alarm,  a D>  and  false  dismissal,  g^,  are  a 
product  of  the  individual  a and  3 • Therefore,  given  the  desired  and  gQ  for  the 
dual  test,  the  individual  tests  a and  g can  be  reduced  proportionally,  which  is  equiva- 
lent to  lowering  the  thresholds. 

Example 

Given  a j_j  = .01  and  g^  = .004,  then  the  individual  a and  g become 

aD  = a0  P1  = -01  ; ^0  = ai  e0  = -004 

Let 

ao  = P,  and  = P0 

Therefore, 

ao  = 01  =‘1;  ftl  = P0  = -0632 

Figure  2 gives  the  calculated  results  of  using  the  above  a , 8q  for  m = 2,  4. 

A simulation  was  conducted  using  the  SPD  in  a dual  test  formulation  in  order  to 
compare  the  predicted  performance  given  in  Figure  2.  Table  IV  compares  the  results 
for  40  trials  of  a simulation  with  the  predicted  performance  given  in  Figure  2.  Note 
that  the  ASN  obtained  in  the  simulation  corresponds  very  well  with  the  ASN  predicted 
in  Fig.  2 for  A = .92. 


A 
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a*  0 i 

B ' 0 0632 

A/0  46 

/ 

PROBABILITY  of  detection 

1 L_ 

/5  0 500  0625 


Fig  2 ASN  shift  alternative. 


TABLL  IV  ASN  tor  the  SFD  under  a dual 
test  formulation. 


SPD 

Predicted 

m * 2 13.05 


Simulation 

15.7 


m = 4 


A * . 92 
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A COMPARISON  OF  PARAMETRIC  AND  PARTITION  SEQUENTIAL 
IN  RADAR  AND  SONAR  PROBLEMS 

R.  F.  Dwyer  and  L.  Kurz 


DETECTORS 


An  important  problem  in  radar  and  sonar  is  the  detection  of  a sinusoidal  signal 
in  no.se  Bussgang  and  Middleton1  obtained  the  optimum  sequential  test  for  the  enve- 
lope of  narrow-band  no.se  and  an  additive  sine  wave.  Blasbalg2  obtained  the  optimum 
'sl.ung  threshold"  for  a Bernoulli  sequential  detector  operating  under  a sine  wave 
> arr.er  in  zero  mean  Gaussian  noise  as  given  in  Reference  1.  His  results  for  the 
imall  SNR  case  were  obtained  by  expanding  the  distribution  function  in  a power  series 
and  keeping  only  second  order  terms.  The  optimum  threshold  was  found  from  the 
equation  p(r,  0)  205,  where  p(r,  0)  is  the  distribution  function  of  the  envelope  under 

noise  only^cond, lions  and  r ,s  the  threshold.  The  efficiency  for  the  optimum  threshold 
was  found  to  be  b5  percent.  In  order  to  obtain  this  optimum  threshold  and  corres- 
ponding efficiency,  Blasbalg  assumed  knowledge  of  both  the  noise  and  signal  plus  noise 
di  st  r:  bull  ons  and  their  small  signal  expansion. 

.■i  a rei  cut  paper,  1 the  authors  introduced  the  theory  of  sequential  partition  de- 
tet  tJrs  <SF>D)  por  tple  Lehmann  alternative,  which  is  a good  model  for  radar  and 
-onar  problems,  with  m=  2,  the  optimum  quantile  yielded  the  efficiency  of  65  percent. 
Hus  result  applies  to  all  distributions  under  the  Lehmann  alternative  satisfying  the 

SNK  onditions  . However,  as  demonstrated  in  Ref.  3,  the  small  SNR  condition 
***  ,lso  b*'  rt'Lixed  while  maintaining  good  performance  estimates.  Also,  for  higher 
m,  tin  SPD  represents  a generalization  of  the  Bernoulli  sequential  detector. 


Now,  . ons.der  the  problem  of  constructing  a parametric  sequential  test  for  the 
incoherent  detection  of  a sine  wave  in  Gaussian,  N(0,  1),  noise.  This  is  the  case  dis- 
1 ussed  in  Rt’1  4 except  that  the  output  envelope  will  be  squared  before  forming  the 
test  statist, c It  is  well  known4  that  the  quadratically  detected  output  has  a non-central 
hi  square  distribution  with  p.  d.  f. 


g(x) 


x+  a 


2c 


>olVT/ 


2. 

o ), 


x > 0 


(1) 


= 0,  x < 0 . 

Where  IQ(  ) is  the  modified  Bessel  function  of  first  kind  and  zero  order,  ct2  represents 
the  noise  variance;  x is  the  squared  output  of  a narrowband  filter,  and  aQ  is  the  ampli- 
tude of  the  sinusoidal  signal.  For  aQ=  0,  Eq.  (1)  reduces  to  the  Rayleigh  d. stnbution 
with  p.  d f . 
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f(x)  = — Ur  e x/2°  , x>  0 (2) 

2o 

= 0,  x < 0 

2 2 

Also,  the  signal-to-noise  ratio,  SNR  is  defined  as,  SNR  = S,  = aQ/2  a • 


Then,  using  Wald's  loglikelihood  ratio,  the  sequential  test  statistic  is  expressed 
as : 


S 

n 


where 


X.  = - a 2/2  + ln[l  (a  /o2  ^/x  ) ] . 

i o'  o o'  VI  J 


The  thresholds  for  the  square-law  sequential  detector  are, 

b < S < a 
n 


(3) 


(4) 


and  the  test  terminates  with  acceptance  of  H or  H,  if  S < b or  S > a,  respectively. 

Notice  that  the  thresholds  (a,  b)  for  the  parametric  detector  are  identical  to  the  ones 
x 

used  in  the  SPD. 

For  small  SNR,  Eq.  (4)  reduces  to 

i 2,,  , 1 2 /0  2.  , 2,-2  , 2 ... 

Xi  = 'ao'2a  <l+Ia0^2°  ) + a0/2°  x/2ct  (5) 

and 

E(X/S)  = SS,  -i  S2  . 

Where  S and  S,  play  the  same  role  of  the  true  and  selected  alternatives  as  6 and  6, 
do  in  the  SPD. 

Since  S,,  S are  assumed  small,  the  value  of  t = tQ(S)  = 0 which  solves  the  moment 
generating  function,  0(t)  = 1,  can  be  given  in  terms  of  the  mean  and  variance  of  X , . 

This  follows  directly  from  the  results  in  Ref.  3. 

Then, 

E(X  ) 

to(S)  = - 2 lT  = l - 2 S/S, 

°X 

1 


(6) 
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Recall3  that  for  the  SPD,  tQ(6)  = 1 - 26/6  j, 

and 

m 

E<V6>  = <S61  - i ) S Ak2/p„k  • 

k 

If  the  relationship  between  6 and  S was  known,  an  efficiency  expression  could  be 
obtained  in  a simple  form,  comparing  the  optimum  parametric  sequential  test  with  the 
SPD.  To  that  end,  a technique  which  measures  the  separation  between  the  hypothesis 
and  alternative  will  be  considered. 

The  change  in  mean  is  often  used  as  a performance  measuring  parameter  in  para- 
metric tests . 

Let,  Hq:  F(x)  and  Hp  G(x)  . Then, 

00 

E(x/Hj)  = J xg(x)dx,  o2/H,  = e[(x-E(x/H))2J 

- 00 

The  change  in  the  mean  is  expressed  as, 

00 

Am  = J x(g (x)  - f(x))  dx  (7) 

-00 

Using  Eqs.  (1)  and  (2),  it  can  be  shown  that 

Am  a#  a 2 (8) 

o 

Therefore,  for  small  a^,  the  change  in  the  mean  is  given  by  the  square  of  the  sinus- 
oidal amplitude. 

For  the  SPD  under  a Lehmann  alternative,  i.e., 

H : F(x)  and  H , : G(x)  = F 1+6(x) 
o l 

The  change  in  the  mean,  using  Eq.  (2)  as  the  reference  p.d.f.  is  given  by, 

oo  oo 

Am  = J x(l  + 6)  F ^ (x)  f(x)  dx  - J x f(x)  dx 
o o 

Whe  re 

-x/2c2 


F(x)  = ( 1 - e 
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(9) 


(10) 

* 

From  Eqs . (8)  and  (10),  the  Lehmann  indexing  parameter  6 is  equal  to  the  SNR 
of  the  parametric  sequential  test,  i.e., 

£ 2/0  2 
6 = aQ/2o 

Therefore,  the  efficiency  expression  is  given  by, 


*VTLm>  = l Ak/Pok 


6 , a -*  0 
o 


(in 


where  S^  represents  the  parametric  sequential  detector  and  the  notat.on  of  Ref.  3 has 
been  used.  Typical  values  for  Eq.  (11)  can  be  obtained  from  Table  I. 

TABLE  I.  Efficiency  for  SPD  under  shift  and  Lehmann  alternatives 


m 

Shift  Alternative 
Gaussian  Noi se,  N(0,  1) 

Lehmann  Alternative 

2 

. 636 

.65 

4 

.88 

.89 

6 

. 94 

.95  | 

8 

. 965 

. 968 

10 

. 977 

— 

. 977 

Unfortunately,  Eq.  (1  1)  only  applies  for  small  6.  For  other  values,  Eq.  (9) 
must  be  solved  as  a function  of  6,  which  is  difficult  in  general.  Gotz  and  Kurz^  have 
evaluated  by  numerical  methods,  the  change  in  the  mean  and  also  the  change  in  variance 
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as  a function  of  the  Lehmann  alternative  using  a reference  c.d.f.  of  N(0,  1). 

A simulation  was  conducted  comparing  the  sequential  square -law  (optimum)  de- 
tector with  the  SPD.  The  input  consisted  of  additive  zero  mean  Gaussian  noise  and  a 
constant  amplitude  sinusoidal  signal  which  was  passed  through  a narrowband  filter  and 
squared.  The  quantiles  for  the  SPD  were  estimated  from  a training  sample  at  the  out- 
put of  the  square  law  device  for  m=  2,  4.  Figure  1 shows  the  results  for  20  trials  of 


Fig.  1.  ASN  - Optimum  vs.  SPD  (Lehmann  alternative)  - simulation. 

the  simulation.  Both  the  selected  Lehmann,  6 ^ , and  parametric , S^,  parameters  were 
set  to  one-half.  As  seen  from  Fig.  1,  the  ASN  for  the  SPD  approaches  the  optimum 
detector's  performance  very  rapidly  as  m increases. 
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RECURSIVE  EQUALIZATION  FOR  TIMING  JITTER  SUPPRESSION  IN  PARTIAL 
RESPONSE  SYSTEMS 

M.  Kavehrad  and  L.  Kurz 

In  this  report,  the  timing  jitter  suppression  problem  described  by  the  authors 
previously*  is  extended  by  introducing  a recursive  version  of  the  equalizer  at  the 
receiver.  It  is  shown  that  for  data  transmission  systems,  disturbed  by  noise,  inter- 
symbol interference  and  timing  filter,  which  use  partial  response  signaling,  the 
recursive  equalizer  yields  improved  performance  when  compared  with  the  corres- 
ponding nonrecursive  equalizer. 

A.  Design  of  Recursive  Equalizers 

Consider  the  data  transmission  system  as  shown  in  Figure  1.  The  corresponding 

2 

recursive  equalizer  structure  is  shown  in  Figure  2.  Following  Fitch  and  Kurz,  in 

NOISE 

n(t) 


s(t)=  lajS(t-iT) 


TRANSMITTER 


IDEAL 

CHANNEL 


RECEIVER  *!/ : *k  ponAM7FR 
FILTER  r(z) 


SAMPLER 


THRESHOLD 

DEVICE 


Fig.  1.  Data  transmission  system. 
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b, 
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Fig.  2.  Recursive  equalizer  structure. 

designing  the  equalizer  we  use  the  Wiener-Kolonogorov  theory  to  estimate  the  sequence 

[a,  } based  on  the  observation  of  the  sequence  {x  , j<k}.  The  integer  D>0  in  Fig.  2 
k J 

represents  a fixed  delay  in  the  estimation  of  the  source  symbols,  a^.  The  following 
steps  are  taken  to  obtain  the  parameters  of  the  equalizer.  Consider  the  z-transform 


r(z)  = [p2B(z)]_1  [e  (z)/B(z‘1]  + 


where  g^x(z)  is  the  cross-spectral  density  function  and  p and  B(z)  are  taken  from  the 
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canonical  factorization  of  spectral  density  function 

gxx<z)  = P2B(z)B(z_1)  (2) 

The  symbol  (+)  in  Eq.  (1)  indicates  that  only  the  non-negative  powers  of  z are  retained. 
The  in  Eq-  (2)  is  obtained  from 

N N 

g (z)  = V ( Y y'r.r.6..  + o^6^zs  (3) 

Bxx'  U \ u Lj  1 j s+j-1  ns/  ' ' 

s i=  -N  j=-N 
which  is  of  the  form 

gxx(z)=  d-2Nz'2N  + d-2N+lZ'2N+1+---+d-lz'1  + d0+dlz+---+d2Nz2N 


d2N-i  " S r-N+j  rN-i+j  + °n  62N-i 


It  should  be  noted  that  in  deriving  Eq.  (4)  it  is  assumed  that  only  intersymbol  inter- 
ferences which  occurs  within  the  N preceding  and  succeeding  symbols  of  the  symbol 
under  consideration  affect  it.  Similarly, 


/ \ V T V c Is  D / N N-l  -N  \ 

gyx(z)=lLL  ri  6s+i-E>  J Z =z  v-N Z +r-N+lZ  +"-+  r0+--*rNZ  ) 


s i=-N 


zN+D 

B(z_1)  C{Z  CN+EH-1 


while 


B(z)  = 1 +bj  z + - . - + b2N  z 


The  values  of  b's  and  c's  give  us  the  design  parameters  for  the  recursive  equalizer  of 
Figure  2. 

To  obtain  specific  comparison  results  with  the  equalizer  of  Ref.  1,  we  assume 
that  the  timing  jitter  immune  partial  response  signals  are  of  minimum  bandwidth 
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(twice  the  Nyquist  bandwidth)  and  are  of  the  Kretzmer  form.  As  was  shown  in  Ref.  1, 
the  overall  spectrum  for  timing -jitter-immune  partial  response  signals  is 

RM  = t(1--Lm)V  rke'j“kT  (9) 

k=-N 

Using  the  design  equations,  the  specific  results  for  the  taps  of  Kretzmer  signals  are: 
l-  ro=rl  = r-l=1 


1 


. 346 


c2  = . 168  c3  = . 14 


bj  = .514 


b2=  .345 


II. 


r„  = r,  = r - = 1 
o 2 -Z 


rl  = r-l  = 2 


c . = .172  c, =.243  c =.0,1  c.=.142 


bi  = bz=b3=.59 


III. 


r = 1 
o 


rl  = r-l  = 2 r2  ~ r-2  ~ 2 


cj  = -■  19 


c2  = .23  c3=.29  c4=.14  c5=.089 


bt  = .212 


b2  = -.22  b = - .21 


b4  = .095 


IV. 


r = 1 
o 


rl=r_l=°  r2=r.2 


c . = -.35  c,  = c = 0 c = - . 52  c , = - . 49 

1 z 4 j o 

bj  = b3  = 0 b2  = . 514  b4  = . 35 


V. 


rl  = r-l  = r3+  r-3=  ° 


r2  = r-2=  2 


r = r = r = - 1 
o 4 -4 


C 1 


C4=C6=C8S°  c 3 = " • 2 1 c5=--26 


c?  = - .061 


c9  = .21 


bj  = b3  = b5  = b?  = 0 b2  = - . 325 


b4=3.48  b6=-1.69  bg  = . 172 


There  are  two  basic  advantages  of  recursive  equalization  described  above:  the  equalizer 
may  be  easily  operated  in  an  adaptive  mode  by  adjusting  the  taps  based  on  the  estima- 
tion of  channel  conditions  it  improves  equalization  is  some  cases  up  to  50%,  basing 
the  comparison  on  the  signal-to-noise  ratio  degradation.  The  specific  numerical 
comparisons  for  the  Kretzmer  signals  are  given  in  Table  I. 
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D^  - unit  delay  operator. 
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ROBUST  SCORE  ESTIMATION  FOR  SIMPLE  UNEAR  RANK  DETECTORS 
I.  M.  Habib  and  L.  Kurz 

Nonparametric  methods,  which  have  found  broad  areas  of  applications  in  detec- 
tion and  estimation,  * are  based  on  the  branch  of  statistical  inference  for  which  the 
parameter  space  associated  with  the  underlying  distributions  cannot  be  represented  by 
a finite  set  of  real  numbers.  A commonly  used  property  of  nonparametric  tests  is  that 
the  level  of  the  test  is  fixed  irrespective  of  the  distribution  of  the  observables.  For  a 
given  level,  the  rank  tests  represent  a class  of  powerful  tests.  To  preserve  the  power 
of  the  rank  tests,  the  knowledge  of  scores  is  essential.  If  the  scores  of  the  rank  tests 
are  selected  beforehand,  based  on  some  general  considerations  irrespective  of  the 
underlying  distribution  under  the  alternative,  the  test  may  have  low  power,  especially 

if  the  selected  scores  are  sufficiently  different  from  those  generated  by  the  true  distri- 
2 

bution.  It  is  the  purpose  of  this  report  to  develop  a robust  (insensitive  to  changes  in 

underlying  distribution)  procedure  which  generates  proper  scores  based  on  the  sample 

3 

of  observables  for  simple  rank  tests.  A similar  philosophy  was  used  elsewhere  by 
the  authors  to  develop  score  estimators  for  generalized  quantile  detectors. 

A.  Score  Estimation 

Let  the  observables  form  a sample  (vector)  x of  size  N and  f (x)  denote  the  value 

/•  % 1 

of  the  i-th  smallest  coordinate  in  x.  Denoting  x'  ' - f^x),  we  have 


x<l><  x<2> 


< X 


(N) 


U) 


where  x^  is  the  i-th  order  statistic  of  x.  Let  r^(x)  denote  the  number  of  x's  < x^ , i.e  , 
the  rank  of  x.  in  Equation  (1). 

The  statistic  R.  = r.(x)  is  called  the  rank  of  x and  R is  the  vector  of  the  ranks  of 

li  l — 

x.  The  ranks  are  well  defined  only  if  the  probability  of  coincidence  of  any  pair  of  co- 
ordinates equals  zero. 

A simple  linear  rank  test  is  given  by 
N 

S = V c.a(R.)  (2) 

1=1 

where  c.  is  0 or  1 and  a(R. ) are  the  scores  of  the  test.  These  scores  are  functions  of 

l ' l 

c.d.f.  , F,  and  p.d.f.  , f,  unde r the  alternative . 

The  scores  are  generated  by  a score  generating  function  cr(t),  0<t<  1,  as  follows: 
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a(i,  f)  = eU(u£\  f)]  l<i<N  (3) 

where  u^  ^ is  an  ordered  sample  from  the  (0,  I)  uniform  distribution.  Generally,  the 
cp  function  form  depends  on  the  test  parameter,  i.e.  , a shift  parameter  or  a scale 
parameter.  For  the  shift  parameter 

f 1 (u) ] 

cp(u,  f)  = s 0 < u < 1 (4) 

f[F'‘(u)] 

where  f1  is  the  derivative  of  f and  F *(u)  is  the  u-th  quantile.  For  the  scale  parameter 

, f’CF*1(u)3 

c?  . (u,  f)  = -1  -F  ‘(u)  , 0 < u < 1 (5) 

1 f[F_1(u)] 

If  N is  large  and  cp(u,  f)  is  smooth  in  the  neighborhood  of  i/N  + 1,  we  have 

a(i> f)  = * (nTT  ■ 0 (6) 

To  generate  cc(-)  and  a(i,  f)  from  x^,  we  use  an  estimator  shown  in  Figure  1.  This 
procedure  for  estimating  the  scores  fits  well  in  the  framework  of  nonparametric  tests 
and  improves  the  power  of  the  tests  under  varying  noise  conditions  and  in  the  presence 
of  outliers.  The  essential  building  blocks  for  the  estimator  of  Fig.  1 were  developed 
in  References  4 and  5. 


Fig.  1.  Score  estimator  for  linear  rank  tests. 
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B . An  Example 

Since  the  performance  of  each  block  of  Fig.  1 was  investigated  in  Refs.  4 and  5 

yielding  good  to  excellent  results,  this  tends  to  indicate  that  the  score  estimation  pro- 

2 

cedure,  a simulation  program  was  developed  for  the  Van-der-Waerden  test  and  com- 
pared to  the  theoretical  scores  given  by 

1 (_i_) 

\m+  1/ 


a = S> 

i 


1 < i < m 


where 


- 1/2  x 

«(x)  = (2tt)  J exp(-^y2)dy 


The  results  of  the  simulation  study  for  m = 20  are  given  in  Table  I. 

TABLE  I.  Scores  of  the  Van-der-Waerden  test. 
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C.  Comparison  With  Other  Methods 

Hajek^  considers  a class  of  score  estimation  procedures  for  simple  rank  tests. 
In  his  notation,  Hajek  proceeds  to  generate  scores  as  follows. 

For  N > I,  consider  the  simple  linear  rank  test 


S = y c . a (R  ) 
n lj  m n m 

i=  1 


where  the  ranks  R • • • , R are  derived  from  a sequence  of  observations  Y Y 

ni  nn  n 1 r 

and  a are  the  scores.  S may  be  written  as 

n n ' 


S (Y  Y ) = y c . a ( V u(Y  . - Y .)) 

n m nn  — / m n \ Lt  ni  nj  / 

j=l 


i=  1 

where  u(  - ) is  the  unit  step  function. 

Consider  the  increment  of  S if  the  observables  Y . 's  are  shifted  by  a random 

n ni  7 

multiple  of  some  numbers  d i.e.  , consider  the  statistic 

S"  = S (Y  , + A d . , • • • ,Y  + A d ) 
n n nl  n nl  nn  n nn 


Let 


dn.  = 1/VTT  (1<  i < if]) 


= 0 ( ^ < i < n) 

and  let  A = A (Y Y ) be  any  statistic  such  that 

n n 1 n 7 

A (XY.+u,  ...,XY  +u)=  XA  ( Y . , • ■ ■ , Y )-oo<u<oo,  X>0 
n 1 n n 1 n 

for  which 

Plim  A (x.  . . . x ) = b.  < oo  ( 1 < j < k) 


f=f. 

J 


n i n'  j 


where  P denotes  probability  and  T denotes  the  j-th  p.d.f.  Hajek  suggest  the  following 
selections: 
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A = E * 
n _ n 


(n+ 1 -j  ) (j  ) 


m ib 

K 

m 


- a • 0 < a - 2 / 


<*<l) 


hj  = E[F;V-a»-F:W 

Sn,  = Sn,(*i ‘V  * 1 <>  ±k 


S . = S _(x  + A d _ , 
nj  nj  1 n mi 


- - , x + A d ) 
n m nm 


The  ratios 


S . - S . 

i = _Ei EL 

nj  j 

. varS  - 
\ nj 

are  then  used  for  score  estimation 

It  is  obvious  that  the  method  suggested  by  Hajek  is  more  complicated  than 
method  outlined  in  Section  A.  The  processing  time  «s  increased  significantly  because 
of  the  ranking  required  and  the  choice  of  the  A statistic  is  not  obvious  for  a given  ap- 
plication.. In  addition,  the  procedure  suggested  by  Hajek  is  not  robust,  e.g.  , if  the 
samples  are  lightly  contaminated  the  score  wall  be  taken  as  those  for  the  pure  distri- 
bution reducing  the  power  of  the  test.  If  the  contamination  of  the  samples  is  severe, 
the  computation  of  l . is  different  for  the  two  underlying  distributions . It  is  not  clear 
how  to  modify  Hajek's  procedure  and  the  approach  fails  completely 

The  procedure  developed  here  is  easy  and  inexpensive  to  implement,  dees  not 
require  ranking  and  storing  of  samples  and  preserves  the  power  of  the  test  even  if  the 
samples  are  contaminated. 
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EXTRAPOLATION  OF  BAND- LIMITED  SIGNALS  USING  A STOCHASTIC 
APPROXIMATION  ALGORITHM 

I.  Kadar  and  L.  Kurz 

Recently  there  has  been  renewed  interest  in  extrapolating  band-limited  signals 
and  computing  their  Fourier  transform  from  noisy  measurements  which  can  only  be 
taken  over  a finite  segment  of  the  signal.  1 5 This  problem  is  of  great  importance  in 
spectral  analysis^  7 in  optical  or  antenna  systems  where  one  desires  to  extend 

the  resolution  of  the  instrument  beyond  the  diffraction  limit  by  super-resolution^’  7 
or  in  image  processing  where  the  object  (or  the  entire  image)  is  reconstructed  by 

4 

extrapolating  a truncated  observed  portion.  As  was  indicated  by  the  authors  previ- 

2 

ously,  the  approaches  based  on  the  prolate  spheroidal  wave  functions  and  the  use  of 
appropriate  projection  in  the  Hilbert  spaceJ  lead  to  improperly  posed  problems.  In  the 
absence  of  noise,  these  algorithms  become  unstable  in  the  limit.  This  problem  can  be 
circumvented  by  the  use  of  appropriate  stopping  rules.  On  the  other  hand,  in  the 
presence  of  noise  large  errors  occur  for  finite  number  of  iterations  and  in  most  practi- 
cal  situations  the  algorithms  will  diverge.  In  a recent  paper,  the  authors  introduced 
a stable  algorithm,  based  on  the  Gladyshev  theorem,  which  guarantees  the  convergence 
of  the  extrapolation  or  restoration  algorithm  with  probability  one  and  in  the  mean- 
square  sense  even  in  the  presence  of  corrupting  severe  noise  outliers.  Another  version 
of  the  stochastic  approximation  algorithm  is  presented  here.  The  approach  uses  ex- 
pansions into  prolate  spheroidal  wave  functions  as  in  Ref.  2,  yet  guarantees  convergence 
with  probability  one  and  in  the  mean -square  sense  in  the  presence  of  noise.  The  new 
approach  has  the  potential  of  being  generalized  to  two-dimensions,  including  the  use  of 
other  orthogonal  function  sets,  thus  opening  the  methodology  to  a broad  class  of  problems 
in  image  reconstruction. 

A.  An  Algorithm  Based  on  Stochastic  Approximation 

Using  the  approach  developed  in  Ref.  2 and  applying  a vector  Robbins -Monro 

strochastic  approximation  (RMSA)ij  algorithm  directly  to  estimate  the  coefficients  in 

the  prolate  spheroidal  expansion  method  in  a manner  similar  to  the  one  used  for  esti- 

q 

mation  of  density  functions,  one  arrives  at  the  new  algorithm. 

The  prolate  spheroidal  expansion  method  in  Ref.  2 yields  for  f(t)  in  terms  of 

g(t) 

00 

= £ gk^k'V0 

k=  1 

whe  re 
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!k‘  ('Ak)i  8(tlVt)d* 


r ^ ^(t)  "i 2 

lim  j ig(t)-  ^ gk J dt-»0 

N -*  oo  _ j k=  1 v^k 

and  {<J>k(t)  } are  the  prolate  spheroidal  wave  functions  generated  by 

/ *k(T>  “it"; '-V-r dT  1 W ■ 

- T 

Now,  if  g(t)  is  corrupted  by  noise,  i.e.  , 
y(t)  = g(t)  + v(t) 

one  can  consider  the  problem  as  if  g(t)  is  unknown  (or  cannot  be  directly  observed)  and 
we  estimate  an  approximation  to  y(t)  using  a sequence  of  independent  samples  t^, 
k=  1,  2,  • • A block  sample  of  y(t)  is  approximated  as 


y.(t)  = y \ *k(t)  3 a i(t) 

k=  1 

jjt 

where  i)i  (t)  = <j>.  (t)/V  X , a is  an  n-vector  of  unknown  coefficients.  The  problem  is  to 

ji,  K K K 

find  a"  which  minimizes  the  integ ral-squa re -error  (ISE)  criterion  which  is  given  by 

1(a)  = J (£(t)  - aT  ji(t))2  dt 
n 

whe  re 

m 

dt  s IT  dt.  , fl  = ( -T,  T) 
k=l  K 

# . 10 
The  value  of  a = a which  minimizes  this  criterion  is 

a*  = [/ jjr (t)  J_T(t)dtj  j 2U)£(t)dt 

L n n 

)j{ 

To  evaluate  a numerically,  using  the  RMS  A,  one  forms 

SfcU  = ik-'A/klVI^.V 

where  A is  a diagonal  matrix,  the  function  Y(a^  t)  is  chosen  so  that 
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5a 


-j  t('n)£(ri)dTi  + r j _f  (T1)  j/T(n)  dr)  Ja 

u n 


m 

•n  = col(r|  j , ....  nm)  , dq  = TT  dri . 

i=  1 

The  function  Y(a,  t)  satisfying  the  above  condition  is  given  by 


Y(a,  t)  = -jS(t)  + Ka 

where 


P(t)  = 


iji_(t)  if  t e n 
o if  t ^ n 


and 

K = / i(t)  j[T(t)  dt 

n 

which  can  be  evaluated  directly  from  the  knowledge  of  the  eigenfunctions.  The  vector 

RMSA  algorithm  can  be  written  as 

^+1  = + (A/k)  (£(tk)  - Kak) 

which  converges  in  the  mean-square  sense  and  w.p.  1. 
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INFLUENCE  OF  NONLINEAR! TIES  ON  THE  PERFORMANCE  OF  PARTIAL  RESPONSE 
SYSTEMS 

M.  Kavehrad  and  L.  Kurz 

In  this  report,  the  influence  of  nonlinearities  on  PAM  systems  using  partial  re- 
sponse techniques  will  be  investigated.  The  report  represents  an  extension  and  gener- 

1 2 

alization  of  previous  results.  ’ 

Such  an  investigation  permits  a meaningful  study  of  practical  systems  because 
even  if  nonlinearity  is  not  added  to  the  system  to  suppress  the  effect  of  impulsive  noise, 
all  real  systems  have  a finite  dynamic  range  of  linear  operation  and  act  as  a soft  lim- 
itor  with  a high  saturation  level. 

In  particular,  the  effect  of  nonlinearities  on  intersymbol  interference  is  consider- 
ed. The  generalization  of  the  equalizer  design  in  the  frequency  domain  is  presented 
which  includes  the  constraints  of  suppression  of  performance  degradation  due  to  non- 
linearities.  Several  partial  response  signals  which  are  of  practical  interest  are  con- 
sidered and,  in  each  case,  the  receiving  filter  (equalizer)  characteristics  assuming 
Square  Root  of  Raised- Cosine  as  the  system  input,  are  obtained.  Finally,  a comparison 
between  a standard  pulse- amplitude- modulation  (PAM)  system  and  a partial  response 
system  in  the  presence  of  nonlinearities  is  given. 

A.  Suppression  of  Impulsive  Noise  by  Addition  of  Nonlinearity  in  Partial  Response 
Systems 

The  problem  of  suppressing  the  large  excursions  of  the  impulsive  noise  by  addi- 
tion of  a preselected  nonlinearity  (naturally  there  is  some  nonlinearity  in  any  system 
over  which  one  has  no  control)  is  of  great  interest  in  the  study  of  data  transmission 
systems  . 

Impulsive  noise  is  usually  characterized  by  high  amplitude  bursts  in  relatively 
short  intervals  and  the  distribution  of  the  amplitudes  have  heavier  tails  than  can  be  rep- 
resented by  a Gaussian  distribution.  During  these  intervals  the  noise  is  assumed  to  be 
a sample  function  of  a random  process  with  a heavy-tailed  distribution  of  various  types 
(Cauchy,  generalized  exponential,  etc.). 

Although  impulsive  noise  is  present  for  only  short  intervals  of  time,  a major 
part  of  the  error  rate  in  the  system  may  be  caused  by  it. 

For  example,  in  telephone  networks,  most  of  the  errors  in  relatively  low  speed 
transmission  systems  is  caused  by  impulsive  noise  which  usually  arises  from  electrical 
contacts  that  cause  a transient  effect  in  the  switching  and  signaling  equipment. 

One  way  to  suppress  impulsive  noise  is  to  clip  or  limit  the  received  waveform 
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by  addition  of  nonlinearity  before  the  receiving  filter.  Different  types  of  nonlinearities 
have  been  used  for  this  purpose. 


A useful  representation  of  a large  class  of  such  nonlinearities  is  the  third  power 

3 

polynomial  device  expressed  by  y=ajX  + a^x  shown  in  Fig.  1,  where  by  proper  selec- 
tion of  the  constants  a ,a  , the  curve  resembles  the  optimal  nonlinearity  found  by 
3 1 2 

Rappaport  and  Kurz . 


Fig.  1.  Third  power  nonlinearity. 

It  should  be  observed  that  the  shape  of  the  nonlinearity  is  not  of  critical  impor- 
tance; the  main  problem  is  just  to  suppress  the  large  excursions  of  burst  noise. 

B.  System  Description 

The  transmission  system  model  considered  in  this  report  is  illustrated  in  Figure 
2.  Pulse  Amplitude  Modulated  (PAM)  signals  are  passed  through  a channel  C(u>).  It 


Noise 


Fig.  2.  Transmitting  and  receiving  system  model 
with  inter  symbol  interference. 

will  be  assumed  that  the  channel  has  an  ideal  low-pass  characteristic;  even  if  it  does 
not,  we  may  still  lump  the  more  general  response  together  with  the  transmitter. 
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Furthermore,  it  is  not  necessary  to  be  restricted  to  baseband  transmission.  The  ef- 
fective noise  is  assumed  to  appear  at  the  input  to  the  receiver,  which  consists  of  a 
memoryless  nonlinearity  followed  by  a linear  filter  (equalizer),  G^(u).  In  a linear 
system,  intersymbol  interference  may  be  considered  independent  of  addition  noise,  but 
if  the  received  signal  is  passed  through  a memoryless  nonlinearity,  distortion  will  take 
place,  i.e.  , a change  in  the  expected  value  of  the  received  waveform.  This  distortion 
is  a function  of  the  transmitted  signal,  the  noise  and  the  nonlinearity. 

As  an  illustrative  example,  suppose  one  desires  that  a single  pulse  of  the  form 

sin(fft/T) 

(irt/T) 


be  received  in  a linear  system  corrupted  by  white  Gaussian  noise. 


The  optimal  transmitting  and  receiving  filters  are  identical.  In  this  case 


Gt(u,)  = Gr(ui)  = 


T I w I < tt/T 
t 0 | oj  | > tt/T 

If  the  effect  of  the  third  power  device  on  the  single  pulse  is  investigated,  the  input  sig- 
nal to  nonlinear  device  is  of  the  form 


sin(irt/T)  . 

x(t) 

where  q(t)  is  Gaussian  noise  of  zero  mean  and  a2  variance,  the  expected  value  of  the 
output  waveform  is 


E{  y (t)}  = A 


sin(TTt/T) 

(irt/T) 


+ a 


, s in(  tt  t/T)  | 
2l  (it  t/T)  J 


3 


(1) 


where 


A = a . + 3 a2  a (^) 

1 x 2 

so  the  received  signal  is  distorted. 

Clearly,  the  cubic  term  in  Eq.  (1)  contributes  to  intersymbol  interference.  Thus 
by  proper  selection  of  a value  for  a^.  a relatively  small  amount  of  intersymbol  inter- 
ference can  be  introduced,  but  in  general,  even  with  the  proper  choice  of  a^,  the  non- 
linear distortion  may  not  be  neglected. 

C,  A Frequency  Domain  Receiver  Design  Procedure 

In  this  section,  a frequency  domain  design  of  the  equalizer  is  given.  Again  the 
Nyquist  problem  is  considered  using  partial  response  technique,  while  the  intersym- 
bol interference  due  to  the  cubic  term  is  taken  into  account  in  the  design  problem.  The 
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actual  filter  frequency  response  is  shown  to  represent  a more  practical  solution  and  has 
superior  noise  performance  compared  to  a standard  PAM  system. 

The  filter  can  be  designed  for  an  input  signal  which  is  a sum  of  the  linear  and  cu- 
bic terms.  This  has  one  major  drawback:  the  relative  amplitude  of  the  linear  and  cu- 
bic terms  are  not  invariant,  they  are  a function  of  the  nonlinearity,  signal  amplitude 

5 

and  noise  parameter.  By  using  the  method  initially  introduced  by  Gibby  and  Smith  in 
derivation  of  spectral  requirement  associated  with  the  Nyquist  problem,  it  was  shown 
that 


Er„("|-t£v 


- jukT 


(3) 

(4) 


where  R(u) ) is  the  overall  frequency  response  and  r(t),  the  corresponding  time  response 
and  r^  = r(kT)  for  k an  integer. 

At  the  output  of  the  nonlinearity,  the  linear  part  of  the  signal  has  a spectrum 
G^(u>)  and  the  cubic  part  has  a spectrum  Cg(ui),  the  receiver  filter  frequency  response 
is  shown  by  Cj^(u),  then  for  a partial  response  system  one  obtains 

oo 


E C^n)(u)G^n)(u)  = T 

n = -oc 

l G(Bn)(u)GLn)(u)  = T 


l 


rke 


•jukT 


n = -oo 


R 


rke 


-jukT 


(5) 


(6) 


where 


CA)(U)  = ga(u  +^tL) 


,(n) 


(u)  = G„(u  + 


2nir 


| u|  < it  /T 
I u I < tt/T 


Subject  to  Eqs.  (5)  and  (6)  one  desires  to  minimize  output  noise  power  at  the  receiver, 
i.  e.  , 

2 No  r"/T  y | _(n) . , 2 

ffn  = 2 7 J /T  ^nlCR  <u)  I du 

- TT  / 1 

Thus,  for  each  value  of  u in  | u [ < n/T  one  obtains  the  stationary  point  of 


J = 


L 


G^lu)!2  +X(u)gJJ')(u)gJJ,)(u)  + v(u)G(3)(u)gJJ,)(u)} 


(7) 
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G r(-)4  £ cj!0^)  (14) 

n = -oo 

Equation  (13)  represents  the  design  formula  for  a general  pulse  amplitude  modulation 
system  equalizer,  including  partial  response  signals.  In  the  next  section,  specific  de- 
sign problems,  involving  the  duobinary  and  Kretzmer's  signals  are  discussed. 

For  simplicity,  in  subsequent  use  of  Eq.  (13)  it  is  assumed  that 


A ■ ZkVJukT 


where 


T { 1 - ■ 


Si  C^lu)  [C^U)] 


[cfW 


G^n,(u)|2 


Sj  I Gb  (u)l 

Sj  [G^(u)]*G^(u)]  n) 

Si  I CB)(u)  I 2 A 


C(An)(u,(G(Rn)(u)]* 


Sni^.lG^u,]^ 


^ilCA)(u)l2  -jrc(n),  n* 

Si[G^)(u)]::;  G^(u)i  B 

c(n)  (u)]*  Sj^)(u)|2 

B Si^A^l^B^ 


G^(u)|2 


D.  Nonlinearity  and  Partial  Response  Systems 

In  this  section,  generalizations  of  some  partial  response  signals  which  are  of 
importance  in  practice  are  considered.  The  generalization  involves  the  inclusion  of 
nonlinearities  in  the  system  in  order  to  suppress  impulsive  noise.  Consider  a duo- 
binary system  (an  example  of  a simple  partial  response  signal)  for  which 


rR  = r(kT)  = 0 


fj  = 1/2 


k / 0,1 


where  r^  is  the  sampled  form  of  the  overall  time  response.  By  using  Eqs.  (5)  and  (6) 
in  a duobinary  signaling  scheme  one  obtains 

S G<an)<u>  GRn)<U>  « f + e'jUT>  d7) 
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and  by  following  the  same  procedure  as  in  the  last  section,  results  in 


S gb)(u)  gr )(u)  = 7 a + e'juT) 

n = -°° 


CRn’l“l  = 


1 * 


SiCA<“)  |c(B>(u>l  I,  T (1  -juT  . (n)  , 

I1|c«>)|*  I'2"*'  I|Ca'u|1 


z„ 


4n)( 


T(1- 


<£><«>  [c«B">(u)]* 


ZjlCAl|u>^  I (,)  . -juT 


L <£’<■»  ■ i<4"W 


Zt  l°»V“l|2 


„(n).  .,2 
CA  <u,l 


(18) 


(19) 


Equation  (19)  represents  the  receiver  filter  response  for  a duobinary  system. 

At  this  point,  it  is  useful  to  discuss  briefly  the  choice  of  the  square  root  of  a 
raised  cosine  signal  as  an  input  to  the  pulse  amplitude  modulation  system. 

The  raised-cosine  characteristic  as  commonly  used  in  the  literature  on  inter- 
symbol interference  is  given  by 


X(- 


T I1  - Sint-2T  (u,‘  t>1  I 


o < U>  < Y (1  + a) 

Y (1  - a)  < «<■=;  (1  + a) 


where  a is  the  excess  bandwidth  parameter  which  is  the  amount  of  bandwidth  used  in 
excess  of  the  Nyquist  bandwidth  divided  by  the  latter  This  a is  a variable  less  than 
one,  a - 0 corresponds  to  the  Nyquist  bandwidth,  and  a = 1 corresponds  to  twice  the 
Nyquist  bandwidth. 

The  impulse  response  of  the  raised  cosine  is 


x(t)  = 


■ / it  T , 
sin(  — ) 


,o»  t , 
cos  ( — =— ) 


it  t ,,  .22  /_2> 

-=r  (1  • 4o  t /T  ) 


which  decreases  asymptotically  as  1/t^  except  for  a = 0. 


from  passing  the  linear  and 


08 

Ui 

o 06 


0 


Fig.  4. 
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which  means  that  the  contribution  of  nonlinearity  to  intersymbol  interference  is  assum- 
ed to  be  zero.  Considering  this  new  constraint  and  following  the  same  procedure  as 


with  Eqs.  (5)  through  (13),  one  obtains 


cr'“»  = E„ 


s„|  Knw- 


V C,n,l  llG(n,l  11*  i . 

A '“«CB  >]  - [jjcjjVn'c'g'ooJ 


<£><«)  I2 


Equation  (20)  represents  the  receiver  filter  (equalizer)  frequency  response  for  a duo- 
binary system  when  the  contribution  of  nonlinearity  to  intersymbol  interference  is 
assumed  to  be  zero  (see  Figure  5).  For  comparison  purposes,  the  equalizer  character- 
istic for  a standard  P.  A.M.  system  is  also  included.  It  can  be  seen  that  the  former 
has  superior  noise  performance  characteristic  when  compared  to  the  latter. 
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In  this  example  a - . 5 and 

Vt 

CA(u)  = r_ 

jT/2  • I 1 - sin[ T(u-  tt  / T)]  f 


0 < u < tt/2T 
tt  / 2T  < u < 3tt/2T 


Jt  »<“<Tf 

T/2  • | ] - sin[T(u-ir/T)]  | JT/2  ■ |(1  - sin)  T(u-tt  /T)J  | < u < 


Following  the  same  procedure  as  for  the  duobinary  system,  the  frequency  response  of 
the  equalizer  for  the  systems  using  other  classes  of  partial  response  signals  due  to 
Kretzmer  can  be  obtained.  For  partial  response  signal  with  (1  + D^)  , where  = 
e ^Ut,  the  corresponding  receiver  filter  response  becomes 


GR(u)  = £nGR>(u)  • [4  c°s2[uT/2]]  (21) 

s 

where  G (u)  is  given  in  Equation  (16). 

Ks 

The  equalizer  frequency  response  of  Eq.  (21)  is  shown  in  Figure  6.  The  time 


Fig.  6.  The  receiving  filter  frequency  response  for  partial 
response  system  with  (1  + D^J^. 
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response  at  the  output  of  this  filter  due  to  linear  and  cubic  terms  are  shown  in  Figure 
7.  For  partial  response  signal  with  (2  + D^- 

gr(u)  = ZnC(Rn,(u)  • {2  + [cos(uT)  - cos(2uT)]  - j(sin(uT)  - sin(2uT))  } (22) 

s 

which  is  shown  in  Figure  8.  The  output  time  responses  of  this  filter  are  shown  in 
Figure  9 . 


Fig.  8.  The  receiving  filter  frequency  re-  Fig.  9. 
ponse  for  partial  response  system 
with  (2+  D,  - D?  ) . 


Output  time  response  of  a partial 
response  system  with  (2+ 
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, . „2 , 

For  partial  response  signal  with  (1  - D^) 

CR(u)  = ^nc[jl)(u)  • 12  sin  (uT)]  (23) 

which  is  shown  in  Figure  10.  The  output  time  responses  of  this  filter  are  shown  in 

2 4 

Figure  11.  For  partial  response  signal  with,  (- 1 + 2D  - D ) 


Fig. 


I 


10.  The  receiving  filter  frequency 
response  for  partial  response 
signal  with  (1  - D^). 

Gr(u)  = Y,n  GR  ^ (u)  • (4  sin2  (uT)] 
s 


Fig.  11.  Output  time  response  of  a 

partial  response  system  with 

(1  - D?). 


(23) 


which  is  shown  in  Figure  12.  The  output  time  responses  of  this  filter  are  shown  in 
Figure  13.  Comparing  Figs.  6,8,  10  and  12,  it  can  be  observed  that  the  last  model 
shown  in  Fig.  12  has  only  one  spectral  null  within  its  pass-band,  therefore  from  the 
viewpoint  of  implementation  it  is  the  best  one.  For  comparison  purposes,  the  output 
time  response  of  the  standard  pulse  amplitude  modulation  system  with  receiving  filter 
frequency  response  shown  in  Fig.  5,  is  given  in  Figure  14.  In  this  system,  it  was 
assumed  that  the  intersymbol  interference  caused  by  the  nonlinearity  in  system  is 
zero.  Comparing  Fig.  14  with  Figs.  7,9,11  and  13,  it  can  be  observed  that,  employ- 
ing a controlled  amount  of  intersymbol  interference,  results  in  well  damped  output 
time  responses,  which  helps  in  increasing  the  timing  jitter  immunity. 
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This  is  another  advantage  of  using  partial  response  signals. 
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ROBUSTIZED  SEQUENTIAL  PARTITION  DETECTORS 
H.  S.  Ashtiani  and  L.  Kurz 

In  this  report,  an  attempt  will  be  made  to  robustize  the  performance  of  sequential 
partition  detectors*  against  impulsive  noise.  In  these  detectors,  although  the  test 
statistic  upon  which  the  decision  is  made  is  robust  by  itself  in  the  sense  of  insensitivity 
to  variations  of  the  noise  distribution  in  a severe  impulsive  noise  environment,  the 
contribution  of  the  large  amplitudes  of  the  impulsive  noise  to  the  test  statistic  may  re- 
duce the  effectiveness  of  its  robustness.  In  these  situations  the  performance  of  the 
detector  may  be  improved  by  introducing  regions  of  large  signal  ambiguity  through 
setting  two  upper  thresholds  in  the  detector.  The  samples  that  fall  into  these  regions 
are  assumed  to  have  come  from  the  impulsive  noise  component  of  the  assumed  mixture 
noise,  and  are  disregarded.  The  mixture  noise  is  represented  by  n(t)  = (l-e)n^(t)  + 
tn.(t)  where  n^(t)  and  n^U)  are  the  Gaussian  and  the  impulsive  noise  components  respec- 
tively. The  region  between  the  two  thresholds  in  the  sample  space  is  partitioned  into 
m -Intervals,  the  number  of  samples  falling  into  each  interval  weighted  appropriately 
are  added,  the  sum  is  the  test  statistic  upon  which  the  detector  bases  its  decision.  In 
sequential  partition  detectors  the  decision,  as  towhether  to  accept  or  reject  the  presence 
of  a signal  in  the  received  samples,  is  made  sequentially  using  Wald's  sequential  pro- 
cedure. In  this  procedure,  samples  are  taken  one  at  a time.  The  test  stops  when  the 
received  samples  give  adequate  information  for  making  a reliable  decision.  Wald's 
sequential  procedure , called  the  sequential  probability  ratio  test,  is  optimum  in  the 
sense  of  minimizing  the  average  risk,  when  cost  per  observation  is  constant. 

The  sequential  partition  detectors  are  created  as  the  result  of  applying  Wald's 
sequential  procedure  to  the  m -Interval  partition  detectors,  m -Interval  partition  de- 
tectors are  robust  in  the  sense  that  they  are  relatively  insensitive  to  variations  in  the 
underlying  noise  distributions.  Moreover,  partition  detectors  can  be  made  non- 
parametric  (i.e.  , have  fixed  false  alarm  rate)  by  choosing  the  partitions  under  the 
hypothesis.  The  optimum  property  of  the  SPRT  combined  with  properties  of  m-Interval 
partition  detectors  are  the  motivation  behind  considering  the  SPD  for  operation  in  an 
environment  of  mixed  noise  distribution. 

It  is  assumed  that  the  reader  is  familiar  with  the  material  of  Ref.  1,  including 
its  terminology  and  notation.  In  Section  A a robustized  version  of  the  sequential  parti- 
tion detector,  and  in  Section  B robustized  versions  of  truncated  sequential  partition 
detectors  are  presented  and  their  performance  is  analyzed. 
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A.  Robustized  Sequential  Partition  Detector  (RSPD) 

We  would  like  to  desensitize  the  SPD  to  impulsive  noise.  For  this  purpose,  the 
noise  is  represented  by  a mixture  distribution  model  of  the  form 

F(x)  = (1  -e)  P(x)  + e H(x) 

where  P(x)  is  assumed  to  be  a distribution  having  low  variance  representing  the  back- 
ground Gaussian  noise,  H(x)  is  a large  variance  distribution  representing  large  ex- 
cursions of  impulsive  noise,  and  e,  the  mixing  parameter,  is  assmed  to  be  small, 
that  is,  we  assume  that  the  impulsive  noise  is  infrequent. 

An  intuitively  appealing  way  of  combating  the  large  amplitudes  of  impulsive  noise 

is  to  place  an  upper  limit  on  the  acceptable  sample  value;  in  other  words,  we  may 

limit  the  space  of  observation  on  both  sides  and  disregard  samples  that  fall  beyond 

2 

two  upper  and  lower  thresholds  and  T^.  Rappaport  and  Kurz  explored  this  idea 
and  showed  that  the  rejection  of  samples  that  are  too  large  compared  to  thresholds 
T.  and  Tti  improves  the  information  rate  of  transmission  in  an  impulsive  noise  environ- 
ment.  We  will  use  the  optimal  thresholds  established  in  Ref.  2 as  a means  of  robustizing 
the  performance  of  the  SPD. 

Since  the  impulsive  noise  component  is  infrequent,  we  will  choose  optimal  parti- 
tions for  Gaussian  noise  alone.  As  is  shown  in  Ref.  3,  quantiles  of  the  noise  distribu- 
tion provide  a near  optimal  partitioning  of  the  sample  space  that  can  be  easily  imple- 

4 

mented.  Furthermore,  robust  quantile  estimation  methods  are  available  making  this 
partitioning  quite  attractive  in  the  impulsive  noise  environment  under  consideration. 

RSPD  Formulation 

Let  x,  , x_ , ....  x be  n i.i.d.r.v.  from  the  c . d.  f . F (x),  v=0,l,leta  = 

(a_  = T.  , a,  , • . . a ,,a  = Tlr)  be  an  ordered  vector  of  constants  subdividing  the 

' 0 L 1 m-1  m U & 

real  axis  from  to  into  m intervals. 

Define  n^  as  the  number  of  samples  falling  into  the  interval 

l|a^  | c x^  < a^_,  k = 1,  • • • m,  i = 1,  2,  . . . nj 


Let 


, = P la.  < x.  < a.  "1  = F (a.  ) - F (a.  .) 

;k  vLk-1  l—  kj  v k v k-1' 


th 


V = 0,  1,  represent  the  probability  of  a sample,  xj , falling  into  the  k interval  under 
the  hypothesis  or  the  alternative,  under  the  constraint  that 
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V,  P.  = C - C , and  V n = n -n 
— i k U L <-i  k e 

k=  1 k=  1 

where  n is  the  number  of  samples  that  fall  beyond  the  two  thresholds  T and  T and 

E i-j  u 

are  censored,  C^  = F^T^),  and  C^  = F^(T^). 

Following  the  same  steps  as  in  Ref.  1,  we  form  the  test  statistic 

m p 

T = V b.  n,  , b.  = f n 
n Lj  k k ’ k P.. 

k=l  0k 

Decision  procedure  at  the  receiver  is  as  for  the  case  of  SPD,  i.e.  , 

"Continue  sampling  as  long  as,  in  B = b < < a = in  A 

Stop  sampling,  accepting  hypothesis  if,  < b 

Stop  sampling,  accepting  alternative  if,  > a" 

where 


- L-JL 


and  B = 


as  before. 

Assuming  the  common  case  of  shift  of  the  mean  alternative,  we  have 

p0k  ■ F'V  • F<V.> 

Pik  = F(ak  ‘ V ' F(ak-1  " V - P0k  + ei[f(ak- 1*  ' f(ak)J 

P . A 

b = In  = ln(l+  0.  p—  ) , A = f(a  ) - f(a  ) 
k Pok  1 F0k 

However,  in  order  to  establish  we  have  to  estimate  two  additional  quantiles  F(ag) 

C.  and  F(a  ) = CI7,  as  well  as  f(a.)  and  f(a  ). 

Li  m u u m 

The  performance  measures  are  the  same  as  in  Reference  1. 


bt0(9) 


PR<e>  - -li 


at0(6) 


bt0(9) 


er<"/0>  = crb  b(e  -at J81)  :k — 1 • E(T.)  * ° 
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ER(n/6)  = 


E(Tt)  = 0 


E(T^)  and  o^-  are  given  in  Reference  1. 
i 

m rn  m 

E'Ti»  ■ S bkpk  = I Ak*<ee,  -i  ef)  l iz 


whe  re 


m m 

z\-z  K.,»-«*k>] 


= f(aQ)  - f(aj) 


+ f(a  ) - f(a 

m - l m 

= f(a  ) - f(a  ) 
u m 

tQ(  9)  's  + 1 ar>d  '1  for  0q  and  6 ^ , respectively;  and  for  small  values  of  0 


Thus , 
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EB(n/0) 


ab 


e 


2—  , E(T  •)  = 0 

- mA,  1 

2 k 


1 -j  P 


Ok 


If  0^  = 0 and  0^  = 0^  then 


ER<n/eo)  - 


Ibl 


m A 

-«v]4efs  ft 

k Ok 


(1) 


E^n/Gj)  -• 


111  A 

e.Ki  p! 


m a ■ 

i 

k 


(2) 


B.  Robustized  Truncated  Sequential  Partition  Detectors  (RTSPD) 

In  order  to  have  practical  meaningful  decision  time,  it  is  desirable  to  set  a 

th 

limit  on  the  sample  size,  i.e.  , to  truncate  RSPD  at  the  N^.  stage  of  the  sequential 
procedure.  This  would  lead  to  a truncated  version  of  RSPD.  As  in  the  case  of  TSPD, 
we  choose  gradually  closing  boundaries  instead  of  abrupt  truncation.  The  price  of 
truncation  is  higher  probabilities  of  error  as  in  the  case  of  TSPD. 

RTSPD  with  Sloping  Boundaries  (RTSPD  (SB)) 

Let  RSPD  be  formulated  as  in  Section  except  now  let  the  stopping  boundaries  be 
given  by  a( 1 


) and  b(  1 - jm— ),  with  a and  b as  before.  Following  similar  steps  as 
T T 


for  TSPD  (SB)  (see  Ref.  1),  we  obtain  the  performance  measures  of  RTSPD  (SB)  for 
shift  of  the  mean  alternative  as  follows 


-M  ‘0(e) 


PRT(SB)( 


at0(6)  **  -an/NTt0(e)  - I b [t()(  0)  , 

e E (e  ) - e E (e 


|b|n/N  t (0) 

E (e  ) 

rte 


> 


LRT(SB)*n/6*  = 


1 -ERT(n/0)/Nr 


m a 

9,  [(<ao>  <9e,  FT 

W Ok 


|b|[eat0(e)E-(e-an/NTt0(e) 


> ’ 1 J + aLl 


|b|to(0)  * |b|n/Nxto(0) 
E (e 


)1 
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where  E and  E denote  conditional  expected  values  given  Hq  and  H^,  respectively, 

and  t„(9),  A,  and  P„.  are  as  in  Section  A. 

0 k Ok 


If  0q  = 0 and  B j = 9 ^ , noting  the  fact  that  tgJB)  is  still  + 1 and  - 1 for  9^  and  9 ^ , re- 
spectively, we  obtain 


ERT(SB)*n/ V - 


m A, 


8>K,>  - «•»>]  4 p^+ 


Ok  NT 


(3a) 


ERT(SB)^n^ei^  - 


m A, 


ei[f<*o'-«aj]494rrr  + (t 

k Ok  T 


and 


- PRT(SB)^0O)  - e ^ +a  ERT(SB)^n/e0)/NT) 


T(SB) 


(3b) 


0. 


N^JC.„X  - 1 ‘ PRT(SB)(ei*  -e  (i  + lbl  ERT(SB)*n/0I^NT) 


T(SB) 

RTSPD  with  Curved  Boundaries  (RTSPD(CB)) 

Let  RSPD  be  formulated  as  in  Section  A,  except  now  let  the  boundaries  be  given 


VnT 


i/Nx  . 

' 1 tin 


by  ae  *‘V*'"A  and  be  *“f  *' 1 with,  r,  a constant  representing  rate  of  time -risk  compensa- 
tion and  a,  b,  NT  defined  as  before. 

Following  similar  steps  as  for  RTSPD(SB)  we  find  the  ASN  and  errors  of 
RTSPD(CB)  for  shift  of  the  mean  alternative.  With  0^  = 0 and  0^  = 6 j , the  expressions 


ERT(CB)(n/0O)  - 


m a L 

6l[f'am'-fl‘‘0>]46.ZSp4  + 


k Ok 


N, 


(4a) 


ERT(CB)*n/9  1*  - 


m A, 


0i[f(ao)  - f(»m)]+  i p^7  + r JCT 

k °k  T 
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‘N_.  - PRT(CB)(60)  - e'a<1  + raERT(CB)(n/V/NT) 


T(CB) 


^Nr 


1 * PRT(CB)(01)  - e 


( 1 + r | b | ERX(CB)(n/91)/NT) 


(4b) 


T(CB) 

C.  Performance  Curves  and  Concluding  Remarks 

The  performance  of  the  robustized  sequential  detectors  of  this  report  in  terms 
of  the  average  sample  size  as  a function  of  signal-to-noise  ratio,  when  the  average 
probability  of  error  is  fixed,  are  given  in  Figure  1.  The  two  types  of  errors  are 


assumed  equal,  i.e.,  a.-0-P^/Z.  This  leads  to  symmetrical  stopping  boundaries, 


a=  in  A- -b=  -inB.  Two  error  rates  are  considered,  P = 


01  and  P = . 0001.  The 
e 


mixing  parameter  is  assumed  e = 10%.  For  simplicity,  the  number  of  partitions  are 
assumed  m = 2 and  m = 4.  For  m = 2,  the  optimal  partition  is  at  aj  = 0.  This  choice 
leads  to  a sign  detector.  For  m=  4,  near  optimal  partitions,  obtained  by  equiprobable 
partitioning  of  the  sample  space,  are  selected.  The  impulsive  noise  thresholds  are 


also  assumed  symmetrical,  i.e.  , am  = Ty  = - TL=  - aQ.  Optimal  thresholds  of  Ref.  2 


are  used;  however,  the  choice  of  these  thresholds  has  a negligible  effect  on  the  ASN. 
This  is  because  [ffa^)  - f(a^)]  is  small,  for  any  reasonable  choice  of  impulsive 
noise  thresholds,  compared  to 
2 


m a; 
v k 


k=l 


0k 


This  means  that  protection  against  impulsive  noise  is  achieved  with  no  measurable  in- 
crease in  the  ASN. 


For  the  truncated  RSPD,  a truncation  point  of  N^,-  100  has  been  chosen.  Since 
this  choice  is  much  larger  than  the  ASN,  the  increase  in  the  errors  of  the  first  and 
the  second  kind,  a,  g,  is  negligible  as  can  be  seen  from  Equations  (3b)  and  (4b).  Thus, 
truncation  has  a negligible  effect  on  the  average  probability  of  error. 

For  the  purpose  of  comparison,  the  performance  curves  of  non-truncated  RSPD 
and  truncated  RSPD's  with  sloping  and  curved  boundaries  are  given  on  the  same  graph. 
The  rate  of  decay  r=  2 is  assumed  for  the  boundaries  of  RTSPD(CB).  The  average 
sample  size  for  the  RTSPD  with  curved  boundaries  is  lower  than  the  non-truncated 
RSPD  and  truncated  RSPD  with  sloping  boundaries. 

A comparison  between  the  non-truncated  RSPD  and  its  parametric  competitor, 

RCBDFS  is  shown  in  Figure  2.  Two  error  rates,  P = .01  and  P = .0001  and  two 

c e 

mixing  parameters  e = 5%  and  e = 10%  are  assumed  with  m = 2,  4.  It  is  seen  that  the 


A 
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RSPD  is  more  robust  than  the  RCBDFS.  This  was  expected  since  RSPD  is  a nonpara- 
metric  detector.  Also,  it  is  seen  that  RSPD  is  more  efficient  that  the  RCBDFS  for 
higher  S/N.  The  seemingly  poor  performance  of  RSPD  at  low  signal -to -noise  ratios 
is  due  to  pronounced  errors,  especially  since  low  values  of  m are  considered.  For 
^^"8®  m this  is  eliminated  because  the  SP D approaches  the  most  powerful  parametric 
detector. 

The  robustness  of  SPD  is  also  verified  by  the  fact  that  the  performance  of 
RCBDFS  is  significantly  improved  with  respect  to  CBDFS  while  RSPD  performs  es- 
sentially the  same  as  SPD. 
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A REPRESENTATION  THEORY  AND  ITS  APPLICATION 
R.  Chassaing  and  L.  Kurz 

In  this  report,  a new  representation  theory  is  developed  to  approximate  the  uni- 
variate and  the  bivariate  p.d.f. 's  in  terms  of  orthogonal  polynomials.  The  generalized 
Cauchy  p.d.f.  is  chosen  as  reference  to  yield  a set  of  orthogonal  polynomials.  Unlike 
the  theory  presented  previously  (see  Ref.  1),  the  new  approach  does  not  require  the 
Markov  assumption  for  the  underlying  distributions.  Using  the  results  of  the  new 
representation,  the  autocorrelation  function  at  the  output  of  the  nonlinearity  specified 
in  Fig.  1 of  Ref.  1 is  obtained.  The  representation  methods  of  this  report  are  useful 
in  obtaining  approximate  expressions  for  likelihood  ratios,  in  performing  spectral 
analysis  of  nongaussian  processes  shaped  by  memoryless  nonlinearities  and  related 
problems.  The  procedure  outlined  here  may  be  easily  extended  to  n dimensions. 

A.  One-Dimensional  Representation  Theory 

A basic  method  of  representing  a probability  density  function  (p.d.f.)  is  by  a gen- 
eralized Fourier  Series, 

PD  = 1/ Ai  <J>i(x)  PR(X)  (1) 

where  p is  the  desired  (unknown)  p.d.f.  , p a reference  p.d.f.  and  <f.(x)  are  the  ortho- 
l)  K.  l 

gonal  polynomials  with  respect  to  the  weight  function,  or 

00 

/ 4,  (x)  <j>.(x)  p (x)dx  = 6 . (2) 

-oo  1 J R 

where  6^  is  the  Kronecker  delta.  The  coefficients  A^  can  be  found  using  Eq  . (1)  and 
the  orthogonality  property  Eq.  (2), 

oo 

A.  = / <Mx)  Pn(x)dx  (3) 

1 -oo  1 D 

Since  b.(x)  are  polynomials,  A,  is  expressed  in  terms  of  the  moments  of  p^ . Although 
Eq . (1)  may  or  may  not  converge,  such  expansion  is  still  useful  in  that  only  a few 
terms  lead  to  a satisfactory  approximation  of  an  unknown  p.d.f.  , 

pd  = Yj  A!  V54)  PR(X)  N sma11  (4> 

i=o 

Using  the  approximation  of  Eq.  (4),  only  the  first  few  moments  of  p^  are  required 
(these  moments  can  be  estimated). 

The  reference  p.d.f.  is  chosen  as  the  "generalized  Cauchy  " p . d . f . ^ ' 


COMMUNICATIONS 


373 


y~L,/cr(v  ti/ci 


2r(i/c)r(v)(i  + 


Lu  | x |]c  v + l/c 
v 


where 


-1  , IT3/c)  ,1/2 

p a t r(l/c")  1 (6> 

since  such  reference  p.d.f.  Eq . (5)  can  represent  a wide  range  of  density  functions. 

For  c = 2,  v = 1/2,  p^  reduces  to  the  Cauchy  density,  a p.d.f.  having  infinite  variance. 

The  p.d.f.  of  Eq.  (5)  can  be  used  successfully  to  characterize  impulsive  noise^’ ^ with 

the  parameters  c and  v indicating  the  degree  of  impulsiveness.  Such  p.d.f.  , p can 

R 

also  characterize  background  Gaussian  noise,  since  as  v — °o,  c = 2,  p reduces  to  the 

R 

Gaussian  p.d.f. 


As  a special  case,  the  desired  p.d.f.  p is  chosen  to  be  the  "generalized" 
, , 2 D 


Gaussian  p.d.f. 

PD  = 2 ni/c"j  exP  t-(n  M C»  (?) 

For  c — 2 , Eq.  (7)  reduces  to  the  Gaussian  p.d.f.  , and  for  c = 1,  p^  becomes  the  La- 
place p.d.f.  The  density  of  Eq . (5)  can  also  represent  impulsive  noise  by  chosing 
• 1 < c < .6  (see  Ref.  6),  and  therefore  could  be  used  as  a reference  p.d.f.  for  impulsive 
noise.  However,  the  chosen  reference  p.d.f.  of  the  class  Eq . (5)  represents  a wider 
class  of  distribution  in  modeling  impulsive  noise. 

B.  Evaluation  of  the  Representation  Expansion 

n 

The  polynomials  <|^(x)  can  be  found  using  a method  developed  by  Szego,  namely 


= <D„D„-i> 


where 


C0C1 


C C c, 

n n+1  Zn 
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C . C 
n-  1 n 

0 1 
x x 


"n+1 


“'2n-  1 
n 


and  the  C^  are  moments  of  the  reference  p.d.f.  The  moments  of  the  desired  and  the 
and  the  reference  p.d.f.  are:  for  p 


D 


r(l!±!) 

c 

J2r  " 2r _.l. 

n r (— ) 


and  for  p 


R 

hrr(m-j-  r)T(r  + -|) 

S2r  ~ “ 1.  r,  1 . 

r<m  - — ) n-j ) 

A computer  program  was  developed  to  find  the  approximated  p.d.f.  resulting  from  the 
expansion  Eq.  (4)  for  several  cases  of  reference  and  desired  p.d.f.  's.  The  results 
are  found  in  Tables  I to  IV. 

C.  Two-Dimensional  Representation  Theory 

The  representation  theory  used  in  the  previous  two  sections  can  be  extended  to 
approximate  a two-dimensional  p.d.f.  Such  p.d.f.  can  later  be  used  to  find  the  auto- 
correlation function  at  the  output  of  a nonlinearity.  The  basic  expansion  has  the  form 


N N 

PD(X1’X2)  =PR(X1)PR(X2)  I Z Amn*m(xl,*-(x’) 

m=o  n=o 


n 


(10) 


where  p^fx^.x^)  is  the  desired  two-dimensional  p.d.f.,  *m(xj)  and  <t>  (x^)  are  t^le  two 

sets  of  polynomials  orthogonal  to  p_(x  ) and  p (x  ),  or 

K.  1 K Z 

00 

/ 4>.(x  )0 :(Xj)pR(x  )dx  = 6 

- 00  J J 

00 

ft  u(x2)<$v(x2)PR(x2)dx2  3 6UV 


PR(x  ) and  p^(x^)  are  the  one-dimensional  p.d.f. 's  given  by  Equation  (5).  The  coefficients 
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A can  be  evaluated  using 
rnn  0 


Amn  = ffP  D(X1'X2)  <WX1)  <t>n(x2)  ^1  dx2 


(12) 


Using  pR(x)  given  by  Eq.  (5)  yields  the  same  orthogonal  polynomials  as  for  the  one- 
dimensional case.  Let 


pD(xi.x2)=ae 


-b(x7  + xp 


(13) 


The  coefficients  ^mn  can  be  expressed  in  terms  of  the  moments  of  the  desired 


Normalizing  Eq.  (13)  yields  a = b8/sir  T(x). 

The  c 
p.d.  f. , or, 

Amn  = -f^f*  1 x2  pD*xl’x2*dxl  ***2 

Transforming  into  polar  coordinates, 

„2/s  Zn 


oc 

A = a / 
mn  J 

o 


dr  J cos‘"0  sin“0  d0 
o 


The  first  integral  becomes 

t2L 

2 


r{  (iii+iliZ)®} 


r(s)2irb(~^)8 


2rr 


r-  m n r(^ti)r(^) 

/ cos  0 sin  0d0 


m, n even 


(14) 


(15) 


(16) 


(H) 


= 0 Otherwise 

Letting  N = 4,  Eq.  (10)  becomes 

Pd^I'V  = pR<xl)pR(x2>  ^ +a22  W *2<x2) 

+ a24  *2^X1*  ^4^x2^  + a42  *4*xl*  <*>2*x2* 


+ *44  W *4(x2^ 


(18) 


With  Amn  given  by  Eq8'  (16)  and  (17,<  *i(x)  8iven  by  Eqs.  (8)  and  (9)  and  pD(x)  given  by 

* K 
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Equation  (5).  A computer  program  was  used  to  find  the  approximations  of  several  two- 
dimensional  p.d.f.'s,  as  shown  in  Tables  V and  VI. 

TABLE  V 


Xi  =' 

0 

= .5 

h 

= 1 

• 

X2 

0 

.5 

1. 

1.5 

2 

0 

.5 

1.  1.5 

2. 

0 

.5 

1. 

1.5 

2. 

PD 

.318 

.248  . 

117  . 

034  . 

006 

.248 

.193  . 

091  .026  . 

004 

.117  . 

.091  . 

043 

.012 

.002 

pA 

.159 

. 139  . 

093  . 

049  . 

022 

.139 

.122  . 

084  .045  . 

019 

,093  . 

,084  . 

061 

.033 

.012 

= 1.5 

X1  = 

2. 

x2 

0 

.5 

1. 

1.5 

2. 

0 .5 

1. 

1.5 

2 

PD 

.033 

.026 

.012 

.003 

.0006 

.005  .004  . 0C2 

.0006  .0001 

PA 

.049 

.045 

.033 

.018 

.006 

.02  .019  .012 

.006 

.003 

b = 

s = 

1 in  Desired  p.  d 

.f. 

c = 2, 

v = 

26  in  Reference  ] 

a.d.f. 

TABLE  VI 

X 

1 =0 

^1 

= .5 

= 1. 

x2 

0 

. 5 

1. 

1.5 

2. 

0 

.5 

1.  1.5 

2. 

0 

5 

1. 

1.5 

2. 

Pd 

• 159 

. 14  . 

096  . 

051  . 

02 

. 14 

. 12  . 

08  .04  . 

019 

09  . 

08  . 

058 

.03 

.013 

pA 

.2 

. 15  . 

072  . 

017  . 

002 

. 15 

. 12  . 

07  .03  . 

01 

072  . 

07  . 

06 

.04 

.021 
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D.  Finding  the  Autocorrelation  Function  at  the  Output  of  a Nonlinearity 

Using  the  transform  method/  the  autocorrelation  function  at  the  output  of  a men.- 
oryless  nonlinearity  is 

Rv(tl,t2)  = " 2 / 1 ^ dw  1 / f(w  )dw  M (w  w )M  (w  w ) (19) 

y (2ir  j)  c 1 c £ 8 1 4 n 1 2 

■where  M^-)  and  Mn(' ) are  the  moment  generating  functions  (the  Laplace  transform  of 
the  joint  p.d.f.)  of  the  signal  and  the  noise  respectively. 

o oo 

f(w)  =f  (w)  +f+(w)  = f g(x)  e'WXdx  + f g(x)e'WXdx  (20) 

-CO  o 

g 

From 

f f(w)dw  = f f (w)dw  + f f+(w)dw  (21) 

c c_  c ^ 

The  Taylor  series  expansion  of  log  has  as  coefficients  the  cumulants  of  the 

process , 


w . 
log  M(wrw2)  = J,  Z wi  ^2  TmT 


1=0  j=o 

where  are  the  cumulants.  Equation  (22)  can  be  written  as, 


‘n^l’V  = exPtwlX10  +W2X01  +W1W2X11  + ~Z~  X02^ 

» » . . X.. 


exp[.Z  .Z  w i ^2 iTjV] 

1=0  j=o  J 

i+j>  2 


The  moment  generating  function  of  Gaussian  noise  is 


W1  2 . W2  2 


MG(W1’W2)  = exptwlnl  +w2n2  +wlw2Rn(tl’t2)  +Tffl  +T<r2I  (24) 

where  R (t.,t,)  is  the  correlation  function,  n.  the  mean  and  a2  the  variance.  Note  that 
n i c,  1 i 

the  first  exponential  expression  in  Eq.  (23)  corresponds  then  to  the  Gaussian  noise 
process . 

For  the  case  of  zero  means  and  unit  variances  for  the  Gaussian  noise,  Eq . (23) 
becomes 
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w + w,  + 2 w w « « <)>. . w*  vA 

Mn(wl'W2)  = exp[_i S —1  Z Z - (25 

i=o j=o  ' J ’ 

a 

<t>ij  were  derived  by  Bowen/  however,  there  is  a slight  error  in  his  expression  (his  in- 
dexing in  the  summation).  It  can  be  shown  that  the  coefficients  d>. . are 

ij 


+iJ  = +{  Qt3(«-l)<t>i-<*.jXo 


a=  1 p = : 

a + p>  2 


l<i<2 


Equation  (26)  is  valid  for  i>l,  j^O,  with  the  double  summation  omitted  when  j =0.  The 

univariate  form  of  Eq.  (26)  appears  in  Kendall  and  Stuart.3  The  bivariate  cumulants 

X can  be  expressed  in  terms  of  the  bivariate  moments  C of  the  desired  two-dimen- 
mn  mn 

sional  p.d.f.  Eq.  (13),  (in  this  case  C„  = A of  Eq.  (14)  or  using  Equation  (18). 

mn  mn 

For  a nonlinearity  with  a transform  f(w),  the  autocorrelation  function  of  Eq.  (19) 
reduces  to 


oo  oo  oo  oo  oo 


Am  + n k 
; A p 


V'i-VvZ.ZZ  Z Z hLhR 

1=0  j=o  k=o  m=o  n=o  ■* 


where 


hL  = 2^Tj  / f(w)  wL  ®W  /2<lw 

for  L = k + i +m  and  R = k+j  +n. 

If  the  nonlinearity  is  a hard  limiter 


nv 


L odd 
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A UNIFORM  POWER  SPECTRAL  DENSITY  JAMMING  SIGNAL 
F*  Cassara 

A-  Introduction 

It  is  often  useful  in  electronic  countermeasures  to  transmit  high  power  noise 
over  some  prescribed  band  of  frequencies  in  an  attempt  to  interfere  with  transmissions 
from  an  unfriendly  source-  Since  we  do  not  know  with  any  certitude  the  frequencies  at 
which  the  source  transmits  and/or  receives,  it  is  desirable  to  use  a high  power  signal 
with  a continuous  uniform  power  spectral  density  bandlimited  over  some  frequency 
band  for  the  noise  jammer-  In  this  work,  a technique  is  described  for  generating  such 
a signal  with  flexibility  in  designing  for  its  center  frequency  and  bandwidth. 

The  technique  employed  utilized  Woodward's  Theorem*  which  states  that  the 
spectrum  of  an  FM  signal  with  large  modulation  index  (Prm8^  10)  takes  on  the  same 
shape  as  the  probability  density  function  (pdf)  of  the  amplitude  of  the  modulating 
waveform- 

A block  diagram  of  the  system  used  to  generate  the  noise  jammer  is  shown  in 
Figure  1-  A nonlinear  network  is  used  to  transform  Gaussian  noise  into  a stochastic 
signal  whose  amplitude  has  a uniform  pdf-  The  resultant  signal  is  then  used  to  frequency 
modulate  a carrier  with  large  P-  The  spectrum  of  the  transmitted  signal  will  then  be 
uniform  and  continuous  centered  around  the  carrier  frequency  with  bandwidth  (BW) 
approximately  equal  to  twice  the  peak  frequency  deviation  (Carson's  Rule)-  The  power 
contained  in  the  transmitted  signal  can  be  made  large  by  using  a high  power  FM 
modulator  or  by  using  efficient  nonlinear  RF  power  amplifiers  following  the  modulator- 

B-  Nonlinear  Network 

The  nonlinear  network  required  in  Fig-  1 is  readily  determined  using  the 
techniques  relating  to  transformation  of  a random  variable-  If  x (see  Fig-  1)  is 

2 

assumed  to  be  a zero  mean  stationary  Gaussian  random  process  with  variance  v , 
and  we  desire  y to  be  a zero  mean  uniformly  distributed  process  over  the  normalized 
interval  [-  y ] , then  the  transfer  function  of  the  nonlinear  network  can  be  shown2 

to  take  on  the  form 
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is  well  tabulated*  ^ 


Fig.  1.  Generation  of  a uniform  power  spectral  density  jamming  signal* 

Although  a broadband  nonlinear  diode  waveshaping  network  could  be  designed 

with  transfer  function  proportional  to  Erf(x/fr  ) , for  simplicity  the  constant  current 

X 4 

biased  bipolar  junction  transistor  differential  pair  configuration  was  employed* 

For  such  a network  the  transfer  characteristic  relating  the  output  collector 
voltage  y and  the  applied  input  differential  base  signal  x is  given  by 


y = Er 


aIkR 


[ 1 + tanh  (p?r)] 


'3) 


kT 

q is  the  charge  of  an  electron,  — — = 26m  V 


where  k denotes  Boltzmann's  constant, 

at  room  temperature  (T  = 300°K),  1^  is  the  magnitude  of  the  constant  current  bias 

source,  R is  the  collector  load  resistor,  Ecc  is  the  collectors1  dc  power  supply 
voltage,  and  a denotes  the  transistor  "alpha*  " The  shape  of  this  characteristic  is 
governed  by  the  term  y f 1 +tanh  ( ^ ^m' V ^ ^ * The  rema*-n'-n8  terms  in  Eq*  (3)  deter- 
mine the  magnitude  (peak-to-peak  swing)  and  dc  level  of  the  differential  pair  output 
voltage*  This  transfer  function  has  the  same  general  form  as  the  desired  error 
function  characteristic*  In  fact,  the  reader  can  show  that  for  the  case  of  44m  V 
the  mean  square  error  between  the  Erffx/r^)  and  its  approximating  curve  is  only 
2*  35%  over  the  range  - 4 < 2^m  V — differential  Pai-r  dc  output  voltage  can 

be  eliminated  by  inserting  a dc  level  shifter  or  coupling  capacitor  between  the  non- 
linear network  and  FM  modulator  of  Fig*  1,  or  it  may  be  used  as  a control  to  adjust 
the  center  frequency  of  the  jammer  signal  generated  by  the  FM  modulator* 


C*  Experimental  Results 


In  the  laboratory  a commercially  available  Gaussian  noise  generator  (GR1  382) 
and  an  FM  modulator  (Wavetek  184)  were  used  to  implement  the  system  described  in 
Figure  1*  The  LM3046  integrated  circuit  transistor  array  was  used  for  the  nonlinear 
network*  Its  measured  static  transfer  characteristic  was  in  good  agreement  with 
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Equation  (3)-  A four  pole  maximally  flat  design  Krohn-Hite  variable  low  pass  filter 
was  inserted  between  the  Gaussian  noise  generator  and  the  nonlinear  network  to  con- 
trol the  noise  bandwidth  and  hence  the  modulation  index  of  the  FM  modulator  output* 
The  pdf  of  the  nonlinear  circuit  output  was  monitored  with  a Polytechnic  built  "pdf 
machine*  11  To  insure  a proper  operation  of  the  machine,  density  measurements 
were  initially  made  with  sinusoids,  triangle  waves,  and  square  waves  and  compared 
with  known  theoretical  responses*  Excellent  agreement  was  obtained*  The  pdf  of 
the  GR  noise  generator  was  then  checked  to  insure  it  was  Gaussian  and  zero  mean* 
Typical  responses  of  the  differential  pair  output  voltage  pdf  and  the  corresponding 
oscillogram  of  the  FM  modulator's  output  spectrum  are  shown  in  Fig.  2(a)  and  2(b), 
respectively*  These  were  measured  with  the  Hewlett-Packard  XY  plotter  and 
Nelson-Ross  spectrum  analyzer*  The  pdf  is  uniform  as  expected,  and  the  output 
signal  has  a uniform  power  spectral  density,  as  Woodward's  Theorem  predicts.  The 
experimental  result  of  Fig*  2(b)  reveals  that  the  spectrum  is  centered  around  the 
carrier  frequency  of  200  kHz,  the  free  running  frequency  of  the  FM  generator,  with  a 
bandwidth  of  237kHz*  Theoretically,  the  bandwidth  expected  is  twice  the  peak 
frequency  deviation  which  is  given  by  the  peak-to-peak  swing  of  the  differential  pair 
output  voltage  (Ij^R)  multiplied  by  the  FM  modulator's  sensitivity  K^(kHz/volt)*  Using 
the  experimental  values  I^s  I*  37mA,  R = 2K  ohm,  = lOOkHz/volt  yields  a 
theoretical  BW  of  274kHz*  The  14%  discrepancy  from  the  measured  value  can  easily 
be  accounted  for  by  component  and  measurement  tolerances  and  Carson's  Rule  ap- 
proximations. Increasing  the  value  of  R or  1^  will  increase  the  peak-to-peak  swing 
at  the  differential  pair  output,  and  hence  increase  the  rms  voltage  at  the  FM 
modulator  input  and,  subsequently,  the  bandwidth  of  the  modulator's  output  spectrum* 
This  was  verified  experimentally  in  a quantitative  manner  as  the  numerical  example 
just  presented*  This  feature  provides  a useful  mechanism  for  electronically  or 
manually  controlling  BW*  In  addition  to  varying  the  parameters  R and  1^  , the  FM 
modulation  index  was  varied  over  the  range  10  Prm8  — ^0  by  adjusting  the  band- 
width (fm)  of  the  Krohn-Hite  low  pass  filter  following  the  GR  noise  generator* 
Rectangular  output  spectrums  with  Carson's  Rule  bandwidths  were  experimentally 
obtained  throughout  this  range* 
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Fig.  2(b).  FM  modulator  output  spectrum  (center 

frequency  = 200  kHz;  bandwidth  = 237  kHz; 

Prms  = imPulse  at  right  of  the 

oscillogram  denotes  dc). 
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TRANSIENT  ACQUISITION  BEHAVIOR  OF  THE  CROSS-COUPLED  PHASE-LOCKED 
LOOP  FM  DEMODULATOR 

F.  A.  Cassara,  H.  Schachter  and  G.  Simowitz 

In  a previous  work^  a novel  cross -coupled  phase -locked  loop  (PLL)  FM  de- 
modulator capable  of  suppressing  co-channel  and  adjacent  channel  interfereres  was 
derived  using  maximum -a-posteriori  estimation  techniques.  Experimental  results 
demonstrating  such  capability  even  in  the  presence  of  strong  input  gaussian  noise, 
moderate  fluctuations  in  the  received  signal  amplitudes  and  multiple  interferer  environ- 
ments were  also  presented.  The  block  diagram  of  this  novel  FM  detector  is  shown  in 
Figure  1. 


dc  gain  = Gj 


RECEIVED  signal* 
DESIRED  SIGNAL  + 
INTERTERER 


DEMODULATED  _ 
OUTPUT  #1  ~ ***  1 


DEMODULATED- 
OUTPUT  *2 


dc  gain  = G 2 


Fig.  1.  Cross  coupled  PLL  FM  demodulator  for 
suppression  of  interchannel  interference. 

The  principle  of  operation  can  be  described  as  follows:  assume  the  input  signal 
consists  of  a frequency  modulated  carrier  plus  a frequency  modulated  interferer. 
Phase -locked  loop  (PLL)  No.  1 locks  on  to  and  tracks  (by  the  capture  effect)  the 
stronger  of  the  two  received  FM  signals  but  its  voltage  controlled  oscillator  output 
signal  (VCO  No.  1)  lags  by  approximately  90°.  An  additional  90°  phase  shift  is  intro- 
duced by  phase  shifter  No.  2 so  that  the  signal  appearing  at  phase  shifter  No.  2's  out- 
put is  180°  out  of  phase  with  respect  to  the  stronger  received  signal.  By  proper  ad- 
justment of  the  gain  constants  of  summer  No.  2,  the  stronger  received  signal  is 
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cancelled,  leaving  only  the  weaker  of  the  two  received  FM  signals  at  the  input  to  PLL 
No.  2.  The  instantaneous  phase  of  VCO  No.  2 output  signal  tracks  the  instantaneous 
phase  of  the  weaker  signal  but  lags  by  90°.  An  additional  90°  phase  shift  is  introduced 
by  phase  shifter  No.  1 producing  a signal  at  the  output  of  phase  shifter  No.  1 which  is 
180°  out  of  phase  with  respect  to  the  weaker  of  the  two  received  FM  signals.  The 
weaker  signal  can  thus  be  cancelled  at  summer  No.  1 leaving  only  the  stronger  signal 
appearing  at  the  input  to  PLL  No.  1.  Since  this  novel  detector  has  two  separate  out- 
puts --namely,  the  outputs  of  the  individual  phase-locked  loops,  it  possesses  the 
capability  of  demodulating  both  the  stronger  and  the  weaker  received  signals  even 
though  they  may  be  co-channel  and  share  the  same  frequency  band.  This  is  a task 
impossible  with  any  of  the  conventional  FM  detectors  since  they  all  obey  the  well  known 
capture  effect. 

In  order  to  determine  the  PLL  parameters  which  insure  successful  acquisition 
and  separation  of  the  two  received  FM  signals,  a theoretical  and  experimental  analysis 
of  the  transient  acquisition  behavior  of  the  system  was  conducted.  Such  an  analysis 
was  constructed  with  the  following  parameters  in  mind: 

(1)  phase -locked  loop  order 

(2)  relative  bandwidth  of  the  two  phase -locked  loops 

(3)  amplitude  ratio  of  the  two  received  FM  carriers 

(4)  frequency  deviation  of  the  two  received  carriers 

(5)  FM  modulation  index  of  the  individual  carriers 


The  normalized  defining  coupled  nonlinear  differential  equations  for  the  novel 
FM  detector  of  Fig.  1 are: 


dt 


= Otj  [sin  (4^-  cpj)  + T|  sin(4>2  - cpj)  - Ti  sin(cp2  - cpj)]  *hLJ(T)  (1) 

= a 2 [sin(i(jj  - cp2)  + rising  - cp2)  - sin(cp1  - cp2)]*  hL2(T)  (2) 


where 

and  4<2  denote  the  phase  modulation  of  the  stronger  and  weaker  received 
FM  carriers,  respectively. 

cp^  and  cp2  represent,  respectively,  the  phase  modulation  of  VCO  No.  1 and 
VCO  No.  2. 
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a,  = 


where  denotes  the  static  hold-in  range  of  PLL  No.  1 and 


denotes  some  selectable  "normalization  frequency" 


= g-  a ^ where  and  are,  respectively,  the  dc  gains  of  the  PLL  low 
pass  filters. 


T|  denotes  the  ratio  of  the  weaker  to  stronger  carrier  amplitudes. 

Digital  computer  solutions  of  these  equations  were  obtained  for  the  cases  of  (a) 
first  order  loops  with  CW  interferers,  (b)  first  order  loops  with  a frequency  offset 
CW  interferer,  (c)  second  order  loops  with  a frequency  off-set  interferer,  (d)  second 
order  loops  with  a frequency  modulated  interferer,  and  (e)  second  order  loops  with 
multipath  interference.  For  each  case  mentioned  the  range  of  loop  parameters,  i.e., 
the  "stability  region"  over  which  the  cross -coupled  PLL  demodulator  can  successfully 
separate  and  demodulate  the  two  received  co-channel  signals  was  determined.  Such 
stability  regions  provide  useful  design  rules.  Figure  2 reveals  a typical  result  for  a 


Fig.  2.  Transient  acquisition  stability  regions  of  the  cross 

coupled  PLL  FM  demoluator  for  first  order  loops  with 
CW  interferer. 
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first  order  PLL  design  with  a single  CW  interferer  located  at  the  center  of  the  IF  fre- 
quency band,  i.e.,  no  offset.  The  weaker  signal  was  frequency  modulated  by  a sinusoid 
with  an  FM  modulation  index  equal  to  10.  The  PLL  loop  parameters  and  shown 
in  Figure  2 were  normalized  to  the  modulation  frequency,  of  this  sinusoid,  i.e., 

was  selected  to  be  <*>rn2*  ^or  success*ul  separation  and  demodulation,  PLL  No. 

2's  bandwidth,  a^,  must  be  designed  to  lie  above  the  solid  curves  of  Figure  2.  Since 
the  hold-in-range  or  equivalently  the  bandwidth  of  each  PLL  is  directly  proportional 
to  the  received  signal  amplitude,  the  smaller  the  value  of  T)  the  larger  we  must  design 
as  indicated  in  the  computer  solution.  For  a fixed  T) , the  minimum  value  of 
occurs  when  Oj  = 0.  Since  the  hold-in-range  of  PLL  No.  2,  ria^  must  be  at  least  as 

large  as  the  peak  frequency  deviation,  gw  ,,  we  have  a,  .....  = gw  ,/ti.  The  results 

m l me.' 

shown  in  Fig.  1 reveal  s^8ht^y  lower  than  this  value.  This  can  be  explained 

by  the  fact  that  each  PLL's  output  was  filtered  on  the  computer  by  a four-pole  maximally 
flat  designed  post  detection  low  pass  filter  with  normalized  3 dB  bandwidth  = 2 rad/sec. 
Such  a filter  reduces  the  high  frequency  distortion  present  in  the  sharp  pulses  occurring 
when  the  PLL  begins  to  cycle  slip.  Experimental  results  have  also  been  obtained  to 
corroborate  selected  theoretical  transient  acquistion  stability  regions.  These  results 
are  also  shown  in  Fig.  1 for  the  case  of  T)  = O.^^Such  results  were  obtained  on  an 
experimental  model  designed  to  accommodate  sinv^oidal  carriers  at  frequencies  of 
200  kHz  with  FM  modulation  index  of  10.  The  hold-in-ranges  of  PLL  No.  1 and  No.  2 
and  were  varied  to  seek  out  the  border  of  the  stability  regions.  Reasonably  good 

agreement  was  obtained  with  the  theoretical  result. 
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SECOND  ORDER  GREEDY  ALGORITHMS  FOR  CENTRALIZED  TELEPROCESSING 
NETWORK  DESIGN 

A-  Kershenbaum  and  R-  Boorstyn 
A.  Introduction 

The  problem  considered  is  that  of  finding  an  optimal  (minimum  cost)  design  for 
a centralized  telecommunication  network  given  a set  of  terminal  locations,  traffic 
magnitudes  between  these  locations,  and  a single  common  source  or  destination- 
(central  site)-  In  order  to  retain  simplicity  and  low  cost  in  the  terminal  hardware, 
such  networks  are  configured  as  trees  comprised  of  communication  facilities  of  a 
single  capacity-  Thus,  the  optimal  solution  to  this  problem  is  a capacitated  minimal 
spanning  tree  (CMST);  i- e- , a tree  of  minimum  total  length  satisfying  a constraint  or 
set  of  constraints  that  limit  the  total  traffic  and/ or  number  of  nodes  in  any  subtree 
rooted  at  the  central  site-  In  centralized  networks,  such  subtrees  are  called  multi- 
point lines-  This  is  a continuation  of  work  previously  reported  on- 

More  formally,  we  are  given  a set  of  locations  (nodes),  A = {a.|i=0,  l,...,n}  , 
where  aQ  is  the  central  site;  a symmetric  distance  measure,  D = {d„  |i,j*  0,  i , • • • , n} , 
giving  the  cost  between  any  pair  of  locations;  and  a constraint,  M,  on  the  total  num- 
ber of  nodes  or  traffic  in  a multipoint  line-  We  seek  a spanning  tree,  T,  rooted  at 
a , satisying  the  constraint,  and  of  minimum  total  length- 

This  formulation  is  quite  general-  All  we  require  of  the  cost  funtion  is  that  the 
cost  of  a link,  d^,  not  depend  upon  what  other  links  are  present  in  the  solution-  The 
constraint  is,  likewise,  quite  general-  One  can,  in  fact  have  more  than  one  constraint 
and  the  constraints  may  be  on  total  traffic,  number  of  nodes,  nodal  degree,  number 
of  links  in  cascade,  or  any  other  quantity  associated  with  the  design-  We  require  only 
that  if  some  subset,  S , of  A,  does  not  satisfy  the  constraints  then  no  other  subset, 

S'  , containing  S,  satisfies  the  constraints-  We  also  assume  that  a star  solution; 
i- e- , all  nodes  connected  directly  to  the  center,  is  feasible-  This  can  always  be  made 
so  by  splitting  a location  into  two  or  more  nodes- 

Several  heuristics  have  been  developed  for  the  solution  of  this  problem  and 
optimal  techniques  exist  as  well-  The  optimal  techniques*  ’ 2’  " are  branch  and  bound 
procedures  which,  in  general  have  running  times  which  are  exponential  in  the  number 
of  nodes-  While  they  are,  thus,  not  practical  as  design  procedures,  results  obtained 
using  them,  in  particular  a recently  developed  procedure,  7 have  led  to  some 
insight  into  the  refinement  of  heuristic  procedures-  The  procedure  described  below 
is  an  outgrowth  of  this  work- 

I 
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Currently,  the  most  widely  used  procedures  lor  the  solution  of  the  CMST 
problem  are  heuristics  which  produce  solutions  within  5%  of  the  optimum  (in 

cases  where  the  optimum  is  known)  and  which  have  running  times  which  are  a loworder 
polynomial,  generally  between  quadratic  and  cubic,  in  the  number  of  nodes- 
Karnaugh,  ^ has  developed  a family  of  Second  Order  Greedy  Algorithms  (SOGA's) 
which  iterate  the  above  heuristics  (which  he  refers  to  as  F (First)  OGA's,  and  have 
longer  running  times  but  produce  results  generally  2 to  3%  better  than  the  above 
heuristics-  The  procedures  described  below  are  variants  of  the  general  SOGA  pro- 
cedures described  by  Karnaugh  and  are  shown  to  be  considerably  faster  than  his; 
indeed,  in  many  cases  they  are  competitive  with  the  simpler  procedures  previously 
used- 

B-  Procedural  Description 

The  most  often  used  heuristic  solutions  to  the  CMST  problem  shiare  the  follow- 
ing properties:  (1)  their  running  time  is  a polynomial  (usually  of  order  between  2 and 
3);  (2)  in  the  absence  of  constraints  they  will  yield  a minimum  spanning  tree  (MSI);  and 
(3)  the  quality  of  the  solution  (i- e- , the  amount  by  which  it  differs  from  the  optimum) 
is  not  controllable  and,  except  very  loosely,  is  not  known- 

The  basic  heuristics  used  to  solve  CMST  problems  can  be  divided  into  two 
categories  - primal  procedures  which  seek  to  improve  a feasible  starting  solution  or 
partial  solution  and  dual  procedures  which  seek  to  make  a low  cost  (infeasible)  starting 
solution  feasible-  We  concentrate  on  primal  procedures,  having  found  them  to  be 

g 

generally  more  flexible-  It  has  been  shown  that  most  such  procedures  fall  within  the 
framework  of  the  following  scheme: 

(t)  Start  with  each  node  on  a separate  multipoint  line  directly  connected  to  the 
center-  Associate  a weight,  wj,  with  each  node,  i,  by  applying  a given 
rule  (w-rule)-  For  each  potential  link  Lj:  interconnecting  a pair  of  nodes, 
i and  j,  define  a trade-off  function,  ty,  as  dy-  w^- 

(2)  Consider  the  (not  previously  considered)  Ly,  for  which  ty  is  minimum- 
If  nodes  i and  j are  in  separate  multipoint  lines  and  merging  these  links 
does  not  violate  any  constraint,  then  add  Ly  to  the  network,  replacing 

or  Lj0,  whichever  is  more  costly-  If  not,  reject  Ly- 

(3)  Update  the  Wj  and  tj:  and  return  to  Step  2 until  no  further  gain  can  be 
obtained-  Usually,  wj=wj  for  nodes  i and  j in  the  same  multipoint  line- 

9 

The  implementation  of  such  a procedure  has  been  considered  in  detail-  A 
careful  implementation  is  shown  to  be  of  order  N^log2  N if  one  wishes  to  consider  all 
branches  and  of  order  NKlog^N  if  one  only  examines  the  K nearest  neighbors  of 
each  node  and  if  the  w-rule  itself  is  not  too  computationally  complex-  In  practice,  the 
most  widely  used  w-rule  is  to  set  w^  = djQ;  i- e- , the  weight  of  each  node  in  component 
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i is  set  to  be  the  minimum  distance  between  a node  in  the  component  and  the  center- 
This  is  known  as  the  Esau- Williams  algorithm-  Karnaugh^*  described  an  alternate 
implementation  of  the  Esau- Williams  procedure  which  has  a different  computational 
complexity,  generally  between  quadratic  and  cubic,  but  which  produces  identical 
results-  While  the  SOGA's  described  below  can  work  with  any  imbedded  FOGA,  for 
the  sake  of  comparison  with  previous  work  we  performed  experiments  with  an  im- 
bedded Esau- Williams  algorithm- 

In  order  to  implement  a SOGA,  one  must  decide  which  arcs  one  will  attempt  to 
force  in  or  out  of  the  solution-  Karnaugh  suggested  two  possibilities: 

(1)  Inhibit  - At  each  stage  in  its  execution,  the  FOGA  brings  in  one  arc  con- 
necting two  previously  unconnected  3ets  of  nodes-  The  Inhibit  loop  suc- 
cessively prevents  each  of  these  mergers  from  taking  place-  Thus,  one 
iteration  of  the  Inhibit  loop  involves  N (where  N is  the  number  of 
terminals)  iterations  of  the  FOGA  each  of  which  prevents  a single  pair  of 
node  clusters  from  merging-  At  the  end  of  each  iteration  of  the  Inhibit 
loop  the  best  of  the  N generated  solutions  is  kept  and  used  in  place  of  the 
original  (uninhibited)  FOGA  solution  and  the  Inhibit  loop  is  repeated  until  no 
further  progress  is  made- 

(2)  Join  - For  each  node  find  its  nearest  neighbor,  bi,  and  nearest 
neighbor  which  is  also  closer  to  the  center  than  aj,  cj  • Successively, 
one  arc  at  a time,  force  arcs  (a^  b^)  and  (aj,  cj)  into  the  solution  if  they 
are  not  already  present-  One  Join  loop  consists  of  a sequence  of  executions 
of  the  FOGA,  each  with  a single  forced  arc-  The  Join  loop  is  iterated, 
starting  from  the  best  solution  obtained  during  the  previous  loop,  until  no 
further  progress  is  made-  The  number  of  FOGA  iterations  in  a Join  loop 

is  bounded  from  above  by  2N-1  but  is,  on  the  average  somewhat  less  than 
N,  the  exact  number  being  a function  of  the  particular  problem- 

Thus,  both  the  Inhibit  and  Join  procedures  have  running  times  of  order  CN 
times  the  running  time  of  the  embedded  FOGA,  the  Join  procedure  being  somewhat 
faster-  The  factor  C is  the  number  of  iterations  required  for  convergence,  i-  e-  , 
to  reach  the  point  of  no  further  progress.  It  would  be  possible  to  greatly  improve  the 
running  time  of  the  SOGA  if  one  could  restrict  one's  attention  to  a small  (i-  e-,<  < N), 
subset  of  the  arcs-  Also  Karnaugh  mentioned  that  the  Join  procedure,  while  some- 
what faster  than  Inhibit,  is  weaker  because  it  restricts  itself  to  local  transformations 
involving  nearest  neighbors-  Using  the  techniques  in  Ref-  7 it  was  possible  to  obtain 
optimal  solutions  for  a class  of  interesting  problems  and  make  observations  about  char- 
acteristics of  optimal  solutions-  This  led  to  the  characterization  of  arc  subsets  which 
are  small  and  at  the  same  time  reasonably  effective  for  use  as  candidates  in  a SOGA.  For 
a CMST  problem  with  18  nodes  and  a constraint  on  the  size  of  any  subtree  of  m=3.  Figure 
la  is  the  solution  yielded  by  the  Esau- Williams  algorithm-  Figure  lb  is  the  unconstrain- 
ed MST  on  the  same  problem.  Figure  lc  is  the  optimal  solution-  Note  that  the  two  arcs 
present  in  the  optimal  solution  but  not  present  in  the  Esau  - Williams  solution  are  (G,0)and 
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(B,  K)  and  are  MST  arcs.  While  it  is  not  necessarily  so  that  all  arcs  present  in  the 
optimal  solution  and  not  in  the  Esau- Williams  solution  are  MST  arcs,  in  all  the  ex- 
periments run  and  checked  to  this  end,  invariably  at  least  one  arc  in  the  optimum  but( 
not  in  the  Esau- Williams  solution  was  an  MST  arc* 


,c  ESAU  — WILLIAMS^ 

OPTIMAL 


Fig.  1.  Comparison  between  Esau- Williams' 
and  optimal  solution* 

This  is  not  to  say  that  it  is  conjectured  that  one  must  always  be  able  to  find  an 
optimum  solution  which  contains  an  MST  arc  not  part  of  the  Esau- Williams  solution 
(when  the  Esau- Williams  solution  itself  is  not  optimal).  It  merely  strengthens  the 
point  that  MST  arcs  are  reasonable  arcs  to  examine  when  seeking  to  improve  a 
heuristic  solution. 

Even  without  this  evidence,  however,  it  is  intuitively  appealing  to  consider  such 

arcs*  Most  primal  heuristics  share  the  property  that  they  proceed  by  connecting 

8 

nodes  to  their  nearest  feasible  neighbors*  The  algorithms  differ  only  in  the  order  in 
which  they  consider  the  nodes*  Thus,  the  solutions  differ  only  in  that  the  distance  to 
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a node's  nearest  feasible  neighbor  may  differ  depending  upon  when  the  node  is  con- 
sidered during  the  execution  of  the  algorithm.  If  such  a heuristic  fails  to  find  the 
optimum,  it  will  be  because  the  algorithm  erred  by  waiting  too  long  before  considera- 
tion of  some  critical  node  (or  nodes)  and  hence  missed  being  able  to  include  some 
desirable  arc  (or  arcs)  in  the  solution*  It  is  quite  likely  that  one  or  more  of  these 
arcs  will  be  MST  arcs;  indeed,  as  mentioned  above,  this  was  always  the  case  with  the 
experiments  carried  out* 

Thus,  a heuristic  is  proposed  which  attempts  to  improve  solutions  generated  by 
some  primal  heuristic  by  forcing  the  inclusion  of  one  or  more  MST  arcs  which  the 
primal  heuristic  left  out*  This  gives  rise  to  the  following  procedure: 

Step  1:  Generate  a MST,  Tm  = (a^  | j = l,  • • • , n-1 }.  Generate  a feasible  tree. 

^f  = . n'0.  Find  S ■ Tm-  T^^T^)  • (For  problems  of  interest,  Sis 

nonempty.  ) 

Step  2:  For  each  subset  SjCS:  set  S ^ = S-Sj*  Remove  all  a e S^  from 
the  network.  Start  a solution  by  including  all  a.  c Sj  • Apply  the  primal  heuristic 
to  complete  the  generation  of  a solution. 

It  should  be  noted  that  an  alternative  procedure  would  be  to  include  a.  in  S,  but 

J 1 

not  to  exclude  b.  in  S-*  It  was  felt,  however,  that  if  elements  in  S,  were  retained  as 
J “ “ 

candidates  for  includion,  duplicate  solutions  would  be  likely  to  result.  The  exclusion 

of  elements  in  S^  guarantees  unique  solutions  and  hence  should  increase  the  likelihood 

of  generating  improvements. 

C*  Computational  Experience 

In  order  to  compare  the  effectiveness  and  efficiency  of  the  procedural  scheme 
described  in  the  previous  section,  a series  of  experiments  was  run.  Nodes  were 
generated  with  random  locations  over  a unit  square  and  unit  traffic;  problems  with 
N * 40,  60,  90,  and  120  nodes  were  generated  and  constraints  varying  between  N/20 
and  N/4  were  examined* 

Alternatively,  a constraint  on  traffic  could  have  been  used  but  we  did  not  feel 
that  significantly  different  results  would  have  been  obtained*  Our  first  objective  was 
to  examine  the  running  time  of  our  implementation  of  the  Esau- Williams  Algorithm 

Q 

within  the  context  of  the  Unified  Algorithm. 

Karnaugh^  described  an  alternate  Implementation  of  the  Esau- Williams  pro- 
cedure and  reports  on  its  running  time  on  an  IBM  370/ 158.  These  times  are  shown 
in  Table  I together  with  those  obtained  using  the  above  implementation  of  the  Esau- 
Williams  procedure  on  a set  of  problems  similar  to  those  reported  on  in  Karnaugh's 
paper;  the  runs  were  made  on  a DEC  KL20/50  processor,  which  is  somewhat  slower 


396 


COMPUTERS  AND  COMPU TER-COMMUNICATION  NETWORKS 


than  a 370/  158.  As  can  be  seen,  there  is  a considerable  difference  between  the  run- 
ning times  of  the  two  procedures,  the  one  described  here  being  faster*  While  it  is 
somewhat  difficult  to  compare  the  running  times  of  the  procedures  based  on  such  a 
small  number  of  sample  runs,  especially  since  the  computers  and  compilers  are  dif- 
ferent, the  difference  in  running  times  is  great  enough  to  warrant  the  conclusion  that 
the  Unified  Algorithm  is  faster  and  of  a lower  order  of  complexity* 

Table  I*  Esau- Williams  Algorithm 

Runtime  - (Initial  execution) 


Number  of  Nodes 

Runtime  (seconds) 

Karnaugh 

Unified 

40 

3*  03 

.51 

60 

3*  25 

1*  1 1 

90 

4*  17 

2*  63 

120 

5*  64 

4*  67 

In  general,  the  second  and  subsequent  calls  to  the  FOGA  do  not  require  as  much 
running  time  as  the  initial  call  since  the  initialization  of  the  problem,  i*  e* , finding 
nearest  neighbors  and  setting  up  the  trade-off  functions,  does  not  have  to  be  repeated* 
Also,  one  can  start  later  FOGA  iterations  with  a partial  solution  obtained  in  the  pre- 

iL 

vious  call*  Specifically,  if  one  is  up  to  the  K- — call  to  the  FOGA  within  a loop,  one 
can  bring  in  the  same  arcs  brought  in  on  the  previous  FOGA  call  until  th-j  new  forced 
inclusion  or  exclusion  induces  a change  in  the  solution.  Table  II  (in  a manner  similar 
to  Table  I)  shows  a comparison  of  the  running  times  for  the  second  and  laier  calls  to 
the  Esau  - Williams  procedure  implemented  both  ways*  Again  we  see  the  Unified 
Algorithm  is  faster* 

Table  II*  Esau- Williams  Algorithm 

Runtime  - (Eater  Executions) 


Number  of  Nodes 

Runtime  (seconds) 

Karnaugh 

Unified 

40 

* 07 

* 03 

60 

* 1 6 

* 06 

90 

* 32 

• 12 

120 

* 57 

• 19 
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Next,  we  examined  the  size  of  the  subsets,  S,  generated,  i*  e* , the  average 
number  of  MST  links  not  present  in  the  FOGA  solution*  Most  primal  heuristics 
generate  subtrees  which  are  MST's  on  the  set  of  nodes  they  contain  along  with  the 
center*  Most  of  the  arcs  in  these  subtrees  (with  the  exception  of  the  arc  directly 
connected  to  the  center)  will  be  MST  arcs*  Thus,  one  might  expect  that  | S | ~ ^ , and 
a vast  improvement  can  be  made  over  a blind  branch  exchange  or  SOGA  procedure* 
However,  this  turns  out  to  be  slightly  conservative*  In  practice,  for  n_^  20  and  v 
in  the  geographic  center,  and  for  n <_  1 5 and  vq  in  the  corner,  )s|«^  • For  larger 
networks,  |s|  ranges  between  ^ and  n n , where  |s|  for  networks  with  non - 
centered  roots  is  generally  larger  than  for  the  same  network  with  v in  the  center* 

I S I 

There  are,  in  fact,  21  subsets  of  S*  Thus,  for  moderately  large  tightly  con- 
strained proglems,  S could  grow  large  enough  to  make  evaluating  all  subsets  im- 
practical. Furthermore,  it  is  reasonable  to  assume  that  not  all  arcs  in  S interact 
with  one  another;  i*  e*  , improvements  in  separate  parts  of  the  network  can  be  found 
and  justified  independently  of  one  another*  Further,  in  running  the  experiments,  it 
was  found  that  if  a group  of  arcs  yielded  an  improvement  when  forced  into  the  solution, 
subsets  of  these  arcs  often  yielded  improvements  too*  Thus,  the  heuristic  was 
modified  by  only  considering  subsets  S.  C S such  that  | S.  | K for  some  given  K* 

The  best  subset,  S.  , is  found  and  permanently  forced  into  the  solution*  We  then  set 
* 1 

S = S - Sj  and  repeat  the  procedure  until  no  further  improvement  can  be  made*  As 
multiple  branch  exchanges  are  often  required  to  effect  improvements,  the  value  of  K 
chosen  should  be  large  enough  to  ensure  that  most  advantageous  exchanges  will  be 
found  and  at  the  same  time  small  enough  to  keep  the  procedure  computationally  tract- 
able; K=  2 seemed  to  work  well*  Experiments  were  run  with  larger  values  of  K;  only 
in  isolated  cases  was  any  improvement  over  K = 2 obtained;  in  no  case  was  an 
improvement  in  excess  of  1 7o  observed*  Thus,  it  was  decided  in  all  the  remaining 
experiments  to  use  the  extended  procedure  (which  proceeds  from  the  best  solution 
found)  and  restrict  the  examination  to  subsets  of  cardinality  less  than  or  equal  to  two* 

Using  this  modification  of  the  new  heuristic  on  a large  number  of  problems,  it 
was  observed  that  in  general,  forcing  arcs  between  nodes  which  are  close  to  the 
center  seemed  to  have  the  greatest  effect  on  the  value  of  the  solution*  This  was 
probably  a consequence  of  the  fact  that  the  Esau- Williams  algorithm  starts  with  nodes 
far  from  the  center  and  hence,  dealing  with  nodes  near  the  center  first  radically 
changes  the  solution  value* 

Large  subsets,  in  some  cases  all  of  S,  sometimes  yielded  the  best  solutions* 
This  means  that  one  cannot  simply  consider  subsets  of  limited  size  without  iterating 
to  subsequently  force  additional  arcs  into  the  solution* 


398 


COMPUTERS  AND  COMPUTER-COMMUNICATION  NETWORKS 


In  particular,  it  was  found  that  for  small  networks  (n  < 20)  with  centered  v , 
there  was  little  difference  (usually  < 1%)  in  performance  between  the  new  heuristic 
and  the  Esau- Williams  solution*  This  is  almost  certainly  because  both  procedures 
were  generating  near-optimal  solution.  However,  for  larger  networks  (up  to  n = 100) 
and  centered  vq  , particularly  for  tightly  constrained  problems,  the  new  heuristic 
performed  noticeably  better  with  improvements  averaging  about  1.5%.  Figure  2 
shows  the  relative  improvement  obtained  using  the  new  heuristic  relative  to  the 
Esau- Williams  algorithm-  Each  point  on  the  curve  represents  an  average  over  12 
sample  runs.  As  can  be  seen,  the  improvement  increases  as  the  problem  size  does- 
Significant  savings  are  obtainable  for  realistic  problems. 
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Surprisingly,  the  improvement  was  only  weakly  correlated  with  problem  tight- 
ness (ratio  of  m to  n);  the  new  heuristic  achieved  slightly  larger  savings  in  more 
tightly  constrained  problems*  Related  to  this,  is  that  substantial  savings  were 
achieved  even  for  very  loosely  constrained  problems*  In  such  cases,  the  constrained 
solutions  differed  only  slightly  from  the  MST,  as  expected* 

Results  were  even  more  encouraging  for  large  networks  with  vq  in  the  corner, 

such  networks  may  be  viewed  as  one -fourth  of  a network  of  4n  nodes  with  v in  the 

o 

center*  For  such  problems,  improvements  averaging  4%  and  as  high  as  8%  were 
observed*  Thus,  the  procedure  appears  to  be  of  value  for  many  realistically-sized 
problems  with  tight  constraints* 


A straightforward  implementation  of  the  Esau  - Williams  procedure  has  a com- 

2 9 

putational  complexity  of  order  n log^,  n"  ^ more  careful  implementation  can  re- 
duce the  complexity  to  order  n log^  n*  As  discussed  previously,  S was  found  to 
range  between  — and  n n • Since  we  examine  subsets  of  cardinality  at  most 


2,  subsets  are  examined  on  each  iteration.  At  worst,  successive  s 

of  cardinality  could  be  introduced  and  thus,  the  procedure  could  iterate  at  most 
times.  In  practice,  the  number  of  iterations  grows  more  slowly  than  |s|* 


successive  subsets 

.....  s| 


Thus,  a careful  implementation  of  the  procedure  has  a complexity  of  |s|  nlogn* 
If,  for  instance,  | S | = n ^ the  procedure's  complexity  is  x?  ^ log  n* 


Finally,  a version  of  the  Inhibit  and  Join  procedures  (as  described  in  Ref*  6 
but  using  the  Unified  Algorithm  in  place  of  the  alternate  implementation  of  Esau- 
Williams)  were  coded  up  and  run  on  another  set  of  problems  and  the  heuristic 
described  in  this  paper  was  then  run  on  the  same  set  of  problems*  Thus,  we  were 
able  to  directly  compare  the  quality  of  the  obtained  solutions  as  well  as  the  running 
times  of  the  new  heuristic  with  Inhibit  and  Join*  The  running  times  are  summarized 
in  Figure  3*  As  can  be  seen,  the  new  heuristic  is  much  faster  than  Inhibit,  and  faster 
than  Join*  Indeed,  it  is  only  2-3  times  slower  than  the  Unified  Algorithm. 


The  quality  of  the  solutions  obtained  varied*  In  some  cases  the  new  heuristic 
outperformed  Inhibit;  in  others  Inhibit  performed  better*  On  the  whole,  the  quality  of 
the  solutions  obtained  with  Inhibit  were  2*  6%  better  than  those  obtained  using  the  Esau- 
Williams  procedure,  while  those  obtained  using  the  new  heuristic  were  1*  9%  better 
than  Esau- Williams*  Both  Inhibit  and  the  new  heuristic  out  performed  Join  fairly 
consistently;  Join  averaged  a 1*5%  improvement  over  Esau- Williams* 
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Fig-  3-  Comparison  of  runtimes- 

D-  Conclusion 

We  found  the  new  heuristic  to  be  of  definite  value  as  it  obtained  solutions 
roughly  2%  better  than  the  Esau- Williams  procedure  without  greatly  increasing  the 
running  time-  In  comparison  with  Karnough’s  SOGA's  the  new  heuristic  is  much 
faster  and  obtains  solutions  somewhat  better  than  Join  and  somewhat  worse  than 
Inhibit.  We  conclude  that  the  new  heuristic  is  useful  as  a practical  design  procedure 
even  when  imbedded  in  a larger  procedure  which  solves  more  global  problems- 

A-  Kershenbaum  and  R.  BoOrstyn 
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ADAPTIVE  ROUTING  IN  NETWORKS 
R.R.  Boorstyn  and  A.  Livne 


Most  adaptive  routing  schemes  perform  about  as  well  as  nonadaptive  schemes, 
when  evaluated  in  a fixed  environment.*  However,  they  do  adapt  to  changes  in  network 
topology  and  input  statistics.  The  reason  they  do  not  show  a significant  improvement 
is  that  they  are  actually  quasi- static,  in  that  they  sense  and  respond  to  the  above  changes 
slowly.  Their  goal  is  to  select  good  paths.2 

We  focus  our  attention  on  the  node,  which  may  be  viewed  as  a multiple  server 
queuing  system.  Most  adaptive  routing  schemes  operate  the  node  as  a collection  of 
single  server  queues.  If  the  node  is  operated  as  a multiple  server  queue,  an  advantage 
at  the  node  of  a factor  equal  to  as  much  as  the  number  of  servers  (outgoing  branches) 
can  be  obtained.  However,  the  control  over  good  paths  may  be  lost.  We  show  that  we 
can  simultaneously  obtain  both  good  paths  and  improved  node  performance.  Further- 
more, the  approach  lends  itself  to  analysis.  We  also  consider  limits  and  extensions. 

Many  computer  communication  networks  use  dynamic  routing  schemes  to  compen- 
sate for  input  traffic  variations,  to  respond  to  changes  in  topology,  and  to  take  advan- 
tage of  temporary  changes  in  loading  in  different  paths.  These'adaptive  schemes  are 
complex  and  are  usually  chosen  and  verified  by  extensive  simulations.  Invariably  dur- 
ing the  design  of  a network  they  are  replaced  by  analytically  tractable  non-dynamic 
(static)  schemes.  We  present  here  a dynamic  routing  scheme  for  which  we  have  been 
able  to  derive  approximate  analytical  models.  Furthermore,  we  can  establish  the  ef- 
ficiency of  this  scheme,  especially  in  heavily  loaded  situations. 

A typical  static  routing  scheme  would  operate  as  follows.  Consider  as  separate 
commodities  the  messages  originating  at  a particular  node  and  destined  for  a second 
node  in  the  network.  There  are  usually  several  good  paths  connecting  these  nodes. 

The  static  routing  scheme  would  specify  the  optimum  proportion  of  traffic  to  be  routed 
over  each  path.  Efficient  algorithms  exist  for  design  of  this  type  of  routing.3 

We  can  identify  one  particular  problem  with  this  approach.  Although  good  paths 
are  indeed  found,  any  node  essentially  operates  as  a collection  of  single  server  queues -- 
one  queue  for  each  outgoing  branch.  Considering  the  node  as  a queue  with  several  po- 
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retain  the  good  paths  for  commodities  and  yet  still  get  the  benefit  of  the  faster  perform- 
ance at  the  node. 

4 

Briefly  our  scheme  is  as  follows.  Consider  a node  as  a single  queue  with  several 
servers  (output  channels).  For  a particular  commodity,  i.e.,  a message  with  a certain 
destination,  the  use  of  some  of  these  servers  would  cause  the  messages  to  be  sent  along 
"bad"  paths  --  either  too  long  or  too  heavily  loaded.  Thus  for  each  commodity  and  at 
each  node  we  specify  a subset  of  the  output  channels  as  allowable  and  permit  the  mes- 
sage to  use  any  allowable  channel  according  to  some  discipline.  Each  commodity  ap- 
pearing at  the  node  has  its  own  set  of  allowable  channels.  These  restrictions  force 
messages  to  use  "good"  paths. 

The  assignment  of  allowable  branches  at  each  node  for  each  commodity  is  one 
level  of  our  adaptive  routing  scheme.  These  assignments  are  based  on  essentially  glo- 
bal information  of  topology,  flows,  and  long  term  averages,  and  may  be  adaptive  in  a 
quasi-static  way.  The  second,  and  truly  dynamic,  level  of  our  routing  strategy  is  local 
and  involves  the  queue  discipline  at  each  node.  As  an  example,  consider  a node  with 
two  outgoing  channels  (servers).  All  message  commodities  fall  into  three  classes. 

Two  of  these  must  use  only  one  of  the  servers  and  have  no  choice.  Messages  join  the 
appropriate  queue  and  are  served  in  turn.  The  third  class  of  messages  may  use  either 
of  the  two  servers  and  join  a third  queue. 

The  best  strategy  is  to  give  priority  to  the  messages  that  have  no  choice,  i.e., 
are  dedicated  to  one  of  the  servers.  Another  strategy  is  to  allow  messages  in  the  third 
category  (non-dedicated)  to  join  the  shorter  of  the  two  dedicated  queues.  We  have  eval- 
uated the  performance  of  both  strategies,  and  other  similar  strategies,  for  this  simple 
two  server  model,  and  for  more  complex  models.  The  measure  of  performance  was 
the  average  time  delay  for  all  messages  traversing  the  node. 

Although  individual  commodities  perform  very  differently,  the  average  delay  for 
all  is  fairly  insensitive  to  the  different  disciplines  we  have  investigated.  (Of  course, 
other  less  appropriate  disciplines  will  behave  poorly.)  Essentially  so  long  as  a rela- 
tively modest  amount  of  the  traffic  has  choice,  and  for  a moderately  high  utilization, 
the  performance  of  the  node  is  close  to  that  of  a multiple  server  queue.  This  has  been 
observed  for  a wide  variety  of  combinations  of  servers,  commodities,  and  flows. 

It  has  been  shown  that  in  the  limiting  case  of  heavy  traffic,  using  diffusion  techni- 
ques, that  a join- the- shortest  queue  discipline  converges  to  the  performance  of  a mul- 
tiple server  queue  so  long  as  there  is  some  traffic  that  has  choice  that  spans,  in  some 

5 6 

combination,  all  the  servers.  ’ This  property  is  also  t ue  for  a wide  variety  of  con- 
figurations . 
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We  have  evaluated  this  dynamic  routing  strategy  using  analytic  techniques  for  sim- 
ple configurations,  approximate  techniques  for  more  complex  models,  simulation,  nu- 
merical analysis,  and  limiting  arguments  in  heavy  traffic.  All  results  reflect  a robust- 
ness in  performance  as  described. 

In  general,  we  have  found,  that  the  node  retains  its  performance  advantage  as  a 
multiple  server  queue  as  long  as  a modest  amount  of  the  traffic  has  choice.  We  have 
developed  approximate  and  fairly  accurate  analytical  models  to  calculate  the  perform- 
ance of  these  nodes. 

We  have  also  developed  analytical  methods  to  imbed  these  nodes  in  a network  and 
to  calculate  the  overall  performance  of  the  network.  Basically,  we  find  that  if  the 
average  number  of  output  channels  per  node  is  k,  then  the  time  delay  for  messages  in 
a heavily  loaded  network  can  be  reduced  by  as  much  as  a factor  of  k when  this  dynamic 
routing  is  used. 

We  have  been  studying  an  improvement  in  the  routing  strategy  by  allowing  the 
service  discipline  to  depend  not  only  upon  queue  lengths  at  the  local  node  but  on  the 
status  of  neighboring  nodes.  Simple  canonical  models  exhibiting  this  behavior  can  be 
analyzed  in  the  heavy  traffic  limit  using  diffusion  techniques. 

This  report  is  an  updated  version  of  a previous  report  and  was  presented  at  the 
AFOSR  Workshop  on  Communications  Systems  and  Applications,  Provinceton,  Mass., 
September  1978. 
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MODELING  OF  DIGITAL  SYSTEMS  USING  MICRCPRCCESSCRS 
D .R.  Kaufman  and  E.  J Smith 

A.  Introduction 

Simulators  for  purposes  of  logic  and  design  verification  play  a major  role  in  any 
design  automation  system.  This  report  presents  a logic  simulator  consisting  of  a net- 
work of  microprocessors  modeling  the  subunits  of  a digital  system.  The  simulator  is 
highly  interactive,  has  a fast  response  time,  and  is  capable  of  modeling  a large  digital 
system.  An  analysis  of  the  advantages  and  disadvantages  of  the  two  commonly  used 
methods,  breadboarding  and  software  simulation,  is  presented.  To  overcome  some  of 
their  limitations  the  new  modeling  system  is  proposed. 

Logic  verification  through  breadboarding  is  used  for  small  and  medium  size  sys- 
tems. The  prototype  provides  an  accurate  model  of  the  logic  but  not  of  the  delay 
characteristics  of  the  device,  since  the  former  is  usually  built  using  different  technol- 
ogy components.  One  of  the  advantages  of  a prototype  is  that  it  can  be  connected  to 
other  systems;  thus  interface  problems  can  be  detected.  When  a subunit  is  finally  built 
the  corresponding  hardware  from  the  model  can  be  replaced  and  thus  the  prototype  can 
act  as  a testing  tool. 

Breadboarding  allows  for  high  interaction  between  the  designer  and  the  prototype. 
The  debugging  tools  (oscilloscopes,  logic  analyzers)  are  familiar  to  the  designer  and 
thus  easy  to  use.  One  of  the  disadvantages  of  breadboarding  is  initialization  of  the  logic 
subunit,  since  the  prototype  can  power-up  in  different  states  and  thus  it  is  difficult  to 
know  whether  the  input  stimuli  have  deterministically  driven  the  prototype  to  a specified 
state.  For  any  large  digital  system  it  becomes  very  difficult  and  expensive  to  build  a 
model,  since  the  components  used  usually  contain  a small  number  of  gates.  Another 
drawback  is  that  the  prototype  hardware  cannot  be  used  in  a different  project. 

Software  simulation  makes  use  of  an  available  general  logic  simulator  that  pro- 
vides the  user  with  a large  number  of  commands.  The  program  accurately  models  both 
the  logic  and  timing  characteristics  of  the  device.  There  is  no  initialization  problem 
since  every  gate  is  assumed  to  be  in  an  undefined  state  at  the  beginning.  The  simulator 
is  typically  a large  software  system  shared  by  more  than  one  user.  Also,  due  to  the 
general  nature  of  the  program  there  is  a great  deal  of  overhead  processing  not  neces- 
sary for  simple  logic  verification.  The  drawbacks  of  software  logic  verification  are 
poor  turn-around  times,  especially  for  large  digital  systems,  and  low  interaction  be- 
tween the  user  and  the  simulator. 


The  simulator  that  will  be  considered  in  the  present  work  was  designed  to  over- 
come some  of  the  limitations  for  the  other  modeling  systems.  The  system  of  an  array 
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of  microprocessors  operating  under  centralized  control.  Each  microprocessor  card 
models  a subunit  of  the  system  to  be  simulated.  Modeling  of  the  overall  is  done  by 
software  using  a standard  simulation  algorithm  for  logic  verification.  The  input  and 
output  pins  of  the  card  are  functionally  identical  to  the  subunit  pins.  By  wiring  the 
cards  together  we  obtain  the  model  of  the  whole  system.  The  system  is  very  flexible 
since  it  allows  for  mixed  models  to  be  used,  i.e.  , some  cards  can  contain  register- 
transfer- level  descriptions  while  others  can  contain  gate-level  descriptions.  The  pos- 
sibility exists  for  a microprocessor  card  to  be  replaced  by  "real"  hardware  thus  allow- 
ing the  system  to  be  used  as  a testing  tool  also.  This  approach  has  all  the  advantages 
of  breadboarding  and  solves  most  of  the  problems.  Initialization  of  the  logic  circuit  is 
no  longer  a drawback  since  simulation  is  done  in  software.  Building  a system  model  is 
no  longer  a major  effort  since  the  tasks  to  be  done  are  to  provide  the  microprocessor 
card  with  a description  of  the  logic  circuit  and  to  wire  the  cards  together.  One  major 
advantage  of  the  system  is  that  the  hardware  is  reusable  in  other  projects.  The  sys- 
tem can  be  connected  to  existing  digital  systems,  but  with  some  limitations  due  to  the 
fact  that  the  microprocessor  array  is  slower  than  a hardware  prototype.  The  system 
has  advantages  over  software  simulation  in  that  it  is  highly  interactive  and  the  response 

m 

time  is  much  better  because  each  subunit  is  simulated  by  a separate  microprocessor. 

A data  structure  for  representing  a logic  net  and  an  algorithm  for  nominal  delay  simu- 
lation will  be  presented  in  a form  suited  for  implementation  on  a microprocessor.  We 
note  here  that  the  main  objectives  are  to  provide  an  accurate  model,  fast  response  to 
input  stimuli,  and  capability  to  handle  large  digital  units.  The  architecture  of  such  a 
system  will  be  presented  and  the  basic  components  described  in  detail.  Attention  will 
be  paid  to  minimizing  the  amount  of  logic  necessary  to  control  the  array  of  microproces- 
sors and  basic  operating  procedures  will  be  discussed. 

B.  Simulation  Data  Base 

In  this  section  a data  structure  for  the  representation  of  logic  circuits  is  presented. 
We  start  by  defining  some  of  the  terms  used  and  the  features  of  the  simulator.  One  of 
the  goals  in  determining  the  format  of  the  data  base  is  to  provide  the  minimum  amount 
of  information  to  the  model  unit  in  order  to  save  memory  and  model  set-up  effort.  One 
of  the  basic  assumptions  is  that  an  8-bit  microprocessor  will  be  used  in  the  model  unit 
to  simulate  the  logic  net. 

Each  model  unit  has  a set  of  inputs  functionally  equivalent  to  the  inputs  of  the  sub- 
unit being  simulated.  These  inputs  will  be  called  primary  inputs  (PI)  and  similarly  the 
outputs  will  be  called  primary  outputs  (PO). 

For  each  gate  (sometimes  referred  to  as  node)  the  following  information  is  needed: 
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the  logic  function  (FCN),  the  propagation  delay  (DLY),  the  set  of  inputs  (INP),  and  the 
gate  output  (CUT).  In  order  to  describe  the  logic  unit  we  need  a table  that  gives  the 
fan-out  list  of  gates  (sometimes  referred  to  as  sinks  (SNK))  for  each  node.  The  list 
also  indicates  the  input  pin  (PIN)  fed  by  the  node.  In  order  to  speed  up  the  retrieval  of 
a node's  sinks  we  associate  with  each  node  a pointer  (PNT)  that  indicates  the  beginning 
of  the  fan-out  list  in  the  sink  table.  The  end  of  the  record  is  indicated  by  a bit  which  is 
set  to  1 to  indicate  the  last  entry. 

The  information  needed  by  the  simulator  is  divided  into  two  groups:  static  and 
dynamic.  The  static  information  is  not  altered  during  simulation  and  can  therefore  be 
stored  in  Read-Only  Memory.  Also,  the  static  information  is  the  minimum  required 
to  represent  a logic  network  all  other  information  being  derived  from  it.  The  static 
information  consists  of  logic  function,  propagation  delay,  and  fan-out  list.  The  dynamic 
information  is  stored  in  Random  Access  Memory  and  is  derived  by  the  microprocessor 
from  the  static  tables  during  initialization  or  altered  during  simulation. 

Besides  the  variables  discussed  so  far,  we  need  two  flags  which  are  used  by  the 
simulator.  They  are:  the  initialization  flag  (INI)  used  to  initialize  gate  inputs,  and  a 
storage  flag  (STO)  used  during  simulation.  The  use  of  these  flags  will  be  explained  in 
the  next  section  describing  the  simulation  algorithm.  In  order  to  show  an  example  of 
coding  of  all  the  above  variables  we  must  define  the  features  of  the  simulator. 

The  model  unit  can  simulate  a logic  net  of  up  to  2,000  gate  elements.  The  range 
of  propagation  delays  is  from  0 (for  wired  logic)  to  a maximum  of  15  units.  The  num- 
ber of  logic  functions  that  can  be  simulated  is  15  and  each  function  is  represented  by  an 
integer  in  the  range  of  0 to  14.  For  example:  PI(0),  PO(l),  AND  (2),  OR  (3),  •••,  etc. 
A gate  is  limited  to  a fan-in  of  16  inputs  while  the  fan-out  is  unlimited.  Figure  1 shows 
the  format  of  the  simulation  data  base. 

The  next  data  structure  to  be  discussed  is  the  implementation  of  the  time-flow 
mechanism.  The  time  element  will  be  represented  by  an  array  of  queues  where  each 
queue  corresponds  to  a time  unit.  The  number  of  entries  in  the  queue-array  is  given 
by  the  maximum  propagation  delay.  Each  queue  stores  the  nodes  that  were  scheduled 
to  be  processed  at  that  point  in  time.  The  current  queue  to  be  processed  is  indicated 
by  a clock  variable  (CLK)  which  is  incremented  modulo  (d  +1)  where  d is  the  maximum 
gate  propagation  delay.  Since  each  queue  has  a variable  length  it  is  necessary  to 
employ  some  memory  management  technique.  We  chose  to  use  dynamic  memory  allo- 
cation and  implement  each  queue  by  a linked  list.  Initially  all  the  memory  allocated 
for  the  time- queues  is  linked  together  in  a list  called  the  available  list  and  is  pointed 
to  be  the  available  pointer  (AVL).  In  order  to  operate  on  the  queue  (insert  and  delete) 


408 


COMPUTERS  AND  COMPUTER- COMMUNICATION  NETWORKS 


we  need  a start- of- list  pointer  (STR)  and  an  end-of-list  pointer  (END).  These  pointers 
are  stored  in  the  time  array. 


SINK/ PIN  LIST  (ROM) 


POINTER /FLAGS  LIST  (RAM) 


Fig.  1.  Simulation  data  base  format. 

When  processing  the  nodes  in  a queue  the  simulator  needs  a stack  to  temporarily 
store  the  nodes'  sinks.  The  stack  is  operated  on  through  the  use  of  a stack  pointer 
(STK)  for  deleting  and  inserting  elements.  The  reason  behind  the  use  of  a stack  and 
the  stack  operations  will  be  described  in  the  next  section.  Figure  2 shows  the  imple- 
mentation of  the  time-queues  and  of  the  stack. 

The  memory  requirements  for  the  data  base  are  as  follows:  Using  an  average 
number  of  sinks  = 2 we  need  8K  for  the  sink  table.  We  allocate  8K  for  the  time  queues 
and  the  stack.  The  rest  of  the  requirements  are  obvious  from  the  formats  of  the 
tables.  Summarizing,  we  need  10K  of  ROM  and  16K  of  RAM  to  handle  2,000  logic  gates. 
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Fig.  2.  Time-flow  mechanism. 

C.  Nominal  Delay  Algorithm 

As  mentioned  previously,  one  of  the  goals  of  this  simulator  is  to  achieve  fast 
response  time  to  input  stimuli.  We  use  a table-driven  simulator  together  with  selective 
trace  and  next-event  time-flow  mechanism  to  meet  the  speed  requirements.  The  simu- 
lation algorithm  is  presented  in  this  section  and  the  major  steps  are  discussed  in  de- 
tail. The  steps  of  the  algorithm  are  listed  below. 

(1)  if  queue  pointed  by  clock  is  not  empty  then  go  to  (3). 

(2)  clock  = clock  + 1;  go  to  (1). 

(3)  get  next  node  from  queue  pointed  by  clock. 

(4)  update  node  output. 

(5)  get  next  node  sink  from  fan-out  list. 

(6)  propagate  node  output  to  sink  inputs. 

(7)  if  sink  is  stored  in  stack  (STO  = l)  then  go  to  (9). 

(8)  store  sink  in  stack  (STC  = 1). 

(9)  if  not  the  last  sink  go  to  (5). 

(10)  if  not  the  last  node  in  queue  go  to  (3). 

(11)  get  next  sink  from  stack  (STC=0). 

(12)  calculate  new  output  of  sink. 

(13)  if  new  output  7=old  output  then  schedule  sink  in  queue  pointed  by  clock  + sink 
delay . 

(14)  if  stack  is  not  empty  then  go  to  (11). 

(15)  go  to  (1). 
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The  algorithm  starts  by  looking  for  the  first  non-empty  queue  (steps  1,2).  When 
a non-empty  queue  is  found  the  outputs  of  the  nodes  stored  in  the  queue  must  be  propa- 
gated throughout  the  logic  network.  A node  is  retrieved  from  the  current  queue  by  using 
the  start  of  list  pointer  in  the  time-array  (step  3).  For  each  node  scheduled  to  be 
processed  we  must  do  the  following  operations.  First,  we  note  that  the  node  output  has 
changed  and  we  record  the  change  by  negating  the  old  output  value  (step  4).  Second, 
we  must  propagate  the  new  node  output  to  the  inputs  of  the  sinks  listed  in  the  fan-out 
table.  The  latter  operation  is  accomplished  as  follows.  Each  node  has  a pointer  as- 
sociated with  the  node  fan-out  list  in  the  sink  table.  Using  the  pointer  the  sinks  are 
retrieved  one  by  one  (step  5).  Each  sink  is  stored  together  with  a pin  number  corres- 
ponding to  the  sink  input  fed  by  the  node.  The  value  of  this  pin,  stored  in  the  input 
table  is  negated  indicating  that  the  sink  had  a change  in  inputs.  Through  this  process 
the  node  is  propagated  to  the  sink  inputs  (step  6).  Next  we  check  if  the  sink  is  already 
stored  in  the  stack  by  looking  at  value  of  the  storage  flag  (step  7).  If  STO=0,  sink  is 
not  in  the  stack,  then  the  sink  is  pushed  on  top  of  the  stack  so  that  later  on  we  can  eval- 
uate its  output.  It  is  possible  that  more  than  one  node  in  the  current  queue  feeds  the 
same  sink.  We  must  first  determine  all  the  input  changes  for  a sink  before  we  can  cal- 
culate its  output  and  determine  whether  and  where  the  sink  should  be  scheduled.  There- 
fore we  must  gather  all  the  sinks  that  had  changes  in  inputs  due  to  changes  in  current 
nodes'  outputs  making  sure  that  no  sink  is  stored  more  than  one.  To  accomplish  this 
operation  we  make  use  of  a stack  (step  8).  If  we  reached  the  end  of  the  fan-out  list  for 
the  current  node  then  if  the  current  node  is  the  last  one  in  the  current  queue  then  we 
finished  propagating  the  changes  in  the  nodes'  outputs  to  their  sink  inputs  (steps  9,  10). 

At  this  point  we  release  the  storage  used  by  the  current  queue  and  attach  it  to  the  avail- 
able list  for  use  in  the  sink  scheduling  process.  For  each  sink  stored  in  the  stack 
(step  11)  we  perform  the  following  operations.  First,  we  calculate  the  new  output  of 
the  sink.  This  is  done  by  retrieving  the  function  code  and  branching  to  the  appropriate 
output  calculation  routine.  The  output  of  a primitive  function  such  as:  AND,  OR, 

NAND,  NOR  can  be  calculated  with  just  a few  instructions  as  follows.  We  examined  all 
the  input  pins  simultaneously  and  observe  that  for  example  the  output  of  an  AND  gate  is 
1 if  all  the  inputs  are  l's  and  <$  otherwise,  similarly  the  output  of  an  OR  gate  is  ($  if  all 
the  inputs  are  0's  and  1 otherwise.  To  calculate  the  output  of  a NAND  gate  we  simply 
negate  the  inputs  and  branch  to  the  OR  function  routine  thus  savings  in  code  can  be 
achieved  (step  12).  Next,  we  compare  the  old  sink  output  with  the  new  sink  output  and 
employ  the  selective  trace  technique.  If  the  two  outputs  are  equal  then  no  further  action 
is  taken  and  the  next  sink  in  the  stack  is  processed.  If  the  two  outputs  differ  then  the 
sink  must  be  scheduled  in  time  in  order  to  propagate  its  output.  The  scheduling  process 
is  done  as  follows.  First,  we  find  the  time  queue  in  which  the  sink  must  be  placed  by 
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adding  the  clock  to  the  sink  propagation  delay.  Addition  is  done  modulo  (maximum  de- 
lay + 1).  The  sink  is  inserted  at  the  end  of  the  appropriate  queue  (end  of  list  pointer) 
using  storage  from  the  available  list  (step  13).  If  the  stack  is  empty  then  we  finished 
scheduling  the  sinks  and  we  are  ready  to  process  the  next  queue  (steps  14,  15).  Note 
that  if  any  sinks  had  a propagation  delay  then  the  clock  is  not  incremented.  We  final- 
ly note  that  the  logic  circuit  reaches  steady  state  when  all  the  time-queues  become 
empty . 

The  basic  steps  of  the  algorithm  were  presented;  however,  the  simulator  performs 
other  functions  such  as  keeping  track  of  the  size  of  the  queues,  updating  of  the  hard- 
ware input  and  output  parts,  and  keeping  in  synchronization  with  other  model  units. 

Also  the  simulator  must  initialize  the  dynamic  tables  before  any  modeling  can  be  per- 
formed. All  these  operations  can  be  performed  in  a straightforward  manner  and  are 
not  considered  in  the  present  report. 

D,  Modeling  System  Architecture 

In  this  section  we  will  consider  the  hardware  requirements  for  the  modeling  sys- 
tem. The  basic  modules,  the  model  unit,  and  the  control  units  will  be  described  and 
the  operation  procedures  presented. 

Each  subunit  in  the  system  will  be  modeled  by  a microcomputer  contained  on  one 
card.  Total  system  simulation  is  achieved  by  wiring  together  the  microprocessor 
cards.  Figure  3 shows  a typical  modeling  system  consisting  of  an  array  of  microproces- 
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Fig.  3.  Modeling  system. 

sors  operating  under  centralized  control.  The  master  controller  serves  the  following 
functions.  It  acts  as  a front-end  for  the  user  who  interacts  with  the  system.  The  user 
feeds  in  the  description  of  the  system  to  be  simulated.  The  information  is  transferred 
by  the  master  controller  to  a read-only  memory  programmer.  The  read-only  memory 
modules  containing  the  static  tables  for  each  subunit  are  then  used  to  personalize  each 


412 


COMPUTERS  AND  COMPUTER- COMMUNICATION  NETWORKS 


:■ 

(S 


microprocessor  card.  The  master  controller  has  the  task  of  synchronizing  the  micro- 
processor cards.  This  is  achieved  through  control  lines  to  be  described  later  on  in 
this  section.  The  system  clock  which  controls  the  simulation  is  also  generated  by  the 
master  controller. 

The  model  unit  contains  the  microprocessor  (CPU),  the  read-only  memory  (ROM) 
modules  with  the  simulation  program  and  the  static  tables  describing  the  logic  network, 
the  random-access  memory  for  storage  of  the  dynamic  tables,  time  queues  and  pro- 
gram variables,  the  primary  input  and  output  ports,  and  the  control  ports.  The  archi- 
tecture of  the  model  unit  is  presented  in  Figure  4. 


Fig.  4.  Model  unit. 

The  control  unit  can  be  implemented  using  also  a microprocessor  and  contains 
the  interface  circuitry  to  the  terminal,  ROM  programmer,  and  the  synchronization 
hardware.  Various  configurations  are  possible  for  the  control  unit  and  in  this  section 
we  shall  concern  ourselves  only  with  the  synchronization  hardware. 

Using  the  nominal  delay  algorithm  we  note  that  simulation  is  divided  into  time 
units.  Each  time  unit  the  microprocessor  must  sample  the  inputs  then  process  the 
current  time  queues.  We  note  that  the  process  is  synchronous  in  that  all  the  cards 
must  sample  the  inputs  at  the  same  time.  Furthermore  we  know  that  a digital  system 
is  usually  controlled  by  a system  clock.  Each  clock  period  consists  of  several  time 
units  during  which  changes  in  the  logic  network  are  processed.  Before  a new  clock 
cycle  can  be  started  the  logic  network  must  reach  steady  state.  With  the  above  con- 
siderations in  mind  we  introduce  three  signal  lines  sufficient  to  control  the  microproces- 
sor array.  They  are:  synchronization  line  (SYNC),  the  acknowledge  line  (ACK),  and 
the  steady  state  line  (SS).  The  SYNC  line  is  generated  by  the  master  controller  and 
indicates  to  each  card  when  to  start  the  next  time  unit  simulation.  The  ACK  line  is 
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generated  by  each  microprocessor  and  indicates  that  the  simulation  of  the  current  time 
unit  is  complete.  The  SS  line  is  generated  by  each  microprocessor  and  indicates  that 
the  subunit  simulated  has  reached  steady  state.  The  system  clock  is  generated  by  the 
master  controller  and  feeds  into  the  primary  inputs  of  the  model  unit,  the  clock  is  not 
a control  line  for  the  model  unit.  The  synchronization  process  is  shown  in  Figure  5. 


Fig.  5.  Synchronization  process  . 

If  it  is  desired  to  connect  the  modeling  system  to  "real"  hardware  then  the  follow- 
ing limitation  should  be  considered.  The  "real"  hardware  should  be  capable  of  static 
operation;  there  should  not  be  any  limit  on  the  speed  of  operation  since  the  array  of 
microprocessors  has  a slow  response  in  "real-time.  " Furthermore  the  clock  of  the 
'real"  hardware  should  be  controlled  by  the  master  unit  to  achieve  proper  synchroniza- 
tion with  the  array  of  microprocessors. 

So  far  we  have  described  a basic  modeling  system  for  digital  systems.  The  sim- 
ulation algorithm  and  the  system  architecture  were  presented.  The  system  can  be 
enhanced  by  the  addition  of  other  features  which  can  be  implemented  in  a straightfor- 
ward manner.  For  example,  it  is  desirable  to  have  functions  to  help  in  the  debug 
process  of  the  logic  network.  Such  functions  could  be:  signal  trace  capability,  set/re- 
set of  gate  inputs,  fault  insertion.  The  new  features  could  be  added  easily  to  the  sim- 
ulator structure  presented  making  the  modeling  system  a valuable  tool  in  the  design 
process . 
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E.  Conclusion 

This  report  presents  a new  approach  towards  modeling  a large  digital  system. 

Due  to  new  advances  in  semiconductor  technology,  the  size  and  complexity  of  digital 
systems  have  been  increasing  at  a steady  rate.  The  logic  verification  phase  of  the 
design  process  has  become  more  complicated  due  to  a lack  of  adequate  tools.  It  was 
shown  that  the  two  commonly  used  methods  for  logic  verification,  software  simulation 
and  breadboarding,  cannot  handle  very  large  logic  networks  efficiently.  Both  methods 
had  advantages  desired  in  a modeling  system  which  led  to  the  idea  that  a combination 
of  hardware  and  software  simulation  might  be  better  suited  for  logic  verification. 

The  system  described  in  this  report  is  a low-cost  modeling  system  capable  of  fast 
response  and  high  interaction.  The  system  consists  of  an  array  of  microprocessor 
units  operating  under  centralized  control.  Each  unit  models  a logic  network  using  a 
software  simulation  algorithm  thus  preserving  the  advantages  of  software  simulation. 

The  units  are  interconnected  in  hardware  which  allows  mixed  modeling  techniques.  : 

The  use  of  multiple  processors  provides  increased  computational  power  for  the  system 
which  leads  to  fast  response  to  input  patterns  even  for  large  logic  networks.  Besides 
modeling  purposes  the  system  could  also  be  used  as  a testing  tool  since  a model  unit 
could  be  replaced  by  "real"  hardware. 

To  implement  the  system,  the  software  and  hardware  requirements  were  described. 

The  nominal  delay  algorithm  analyzed,  used  standard  simulation  techniques  such  as: 
function  simulation,  selective  trace  and  next- event  time  flow  mechanism.  The  algorithm 
and  simulation  data  base  were  presented  in  a form  suitable  for  implementation  on  a 
microprocessor.  The  basic  system  architecture  consisting  of  the  control  unit,  model 
unit,  and  synchronization  hardware  was  described. 

This  brief  report  is  not  meant  to  be  a complete  description  of  a modeling  system 
but  provides  a new  approach  towards  logic  verification  and  the  basic  concepts  for  im- 
plementation. Obviously  there  is  room  for  improvement  in  that  other  features  such  as 
debug  aids  should  be  added.  Additions  to  and  modifications  of  the  system  for  particular 
applications  should  be  straightforward  to  implement  on  the  basic  structure  presented. 
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ADAPTIVE  CHANNEL  CAPACITY  CONTROLLERS  FOR  COMMUNICATION  NETWORKS 
L.  Shaw  and  K.  Sohraby 

This  study  considers  the  performance  improvements  which  might  be  attained  by 
switching  the  channel  capacity  for  messages  leaving  a buffer  in  a store-and-forward 
node  of  a communication  network.  The  proposed  controller  bases  its  decision  on  the 
number  of  messages  currently  in  the  buffer.  Such  decisions  are  made  relatively 
frequently,  say  after  intervals  during  which  a small  number  of  messages  are  expected 
to  arrive  and  depart.  This  mode  of  operation  is  feasible  where  several  priority 
classes  of  messages  share  transmission  links  (e.g.  , voice  and  data)  and  the  relative 
share  for  each  class  can  be  quickly  changed. 

The  analysis  makes  use  of  transient  properties  of  M /M  / 1 queues,  approxima- 
tion of  Non-Poisson  queues  by  Poisson  queues,  Dynamic  Programming,  special 
search  techniques  for  minimizing  multimodal  objective  functions,  and  hierarchical 
optimization  for  decoupling  the  network  optimization  into  a sequence  of  single  queue 
optimizations. 

2 

This  project  was  introduced  in  Ref.  , and  is  more  completely  described  in  a 
Ph.  D.  thesis.  * 

A.  Single  Queue  Optimization 

A channel  capacity  p e { p j , . . . Pm  } = U is  to  be  chosen  for  the  time  inter- 

val 0 < t £T  in  order  to  minimize  the  following  expressions  for  mean  delay  per  mes- 
sage plus  channel  utilization  cost,  when  there  are  i messages  in  the  queue  at  t = 0 . 


T 

Ci=  / 


K-l 

Yj  j j3  pik  (p  . t) 

Lj=l 


dt  + a T p (p) 


II) 


The  ptj  (t)  are  transition  probabilities  for  a queue  with  Poisson  arrivals  (rate  X.  ), 
expotential  service  times  (rate  p ),  waiting  room  of  size  (K-l).  These  probabilities 
are  obtained  by  integration  of  the  birth  - death  differential  equations.  The  cost  of  lost 
messages  is  introduced  by  the  factor  ft  , the  channel  costs  are  p (p)  per  second,  and 
a represents  a weighting  factor  between  dollars  of  channel  cost  and  seconds  of  mes- 
sage delay. 

Since  U is  finite  there  is  clearly  an  optimal  policy  M which  assigns  some  p^ 
to  each  M.  so  as  to  minimize  CL  (i  = 0,  1 , 2.  . ( K-  1 )).  This  best  policy  can  be  found 
efficiently  by  Howard's  policy  improvement  method  of  Dynamic  Programming. 

The  output  process  of  such  a queue  does  not  have  a simple  structure.  However, 
it  can  be  well  approximated  in  each  T second  interval  by  a Poisson  process.  For 
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each  initial  state  i and  service  rate  p.  , ons  can  compute  the  mean  departure  rate 
over  the  T second  interval  as 


i7i  (H  ) = H 


10 


(2) 


where  piQ  (t)  is  the  transient  probability  for  an  M/M/1 /K  queue. 

The  effectiveness  of  this  poissonian  assumption  was  tested  by  simulation  of 
both  the  actual  and  approximating  processes  as  the  inputs  to  a second  queue.  The  av- 
erage message  delays  in  the  second  queue  served  as  the  relevant  measure  of  approx- 
imation quality.  The  resulting  degree  of  approximation  depended  on  the  initial  state 
of  the  second  queue,  numerical  values  of  basic  arrival  and  departure  rates,  etc.  An 
average  approximation  error  of  about  10%  justified  use  of  this  method. 

B.  Multilevel  Optimization  of  a Network  of  Adaptive  Capacity  Nodes 

The  criterion  function  for  a network  of  adaptive  capacity  nodes  is  the  sum  of 
delays  and  channel  utilization  costs  for  each  node.  For  a network  of  N nodes 

N 

C = E C (3) 

i=l 

Unfortunately,  the  capacity  assignment  policies  for  the  various  nodes  cannot  be  op- 
timized separately  because  policy  changes  at  one  node  will  change  the  arrival  rates 
at  the  following  nodes.  However,  the  additive  form  for  the  network  cost  suggests 
consideration  of  hierarchical  optimization  techniques  which  have  been  effective  in 

. 3 

similar  optimization  problems. 

The  simple  three  node  loop  network  in  Fig.  1 displays  the  interaction  of  individ- 
ual node  optimizations.  Reference  1 explains  how  the  hierarchical  optimization  de- 
scribed here  for  this  simple  network  can  be  generalized  in  a straightforward  manner 
to  any  network 

The  i**1  node  is  characterized  by 

K.  = capacity  of  the  it^1  buffer  (messages) 
i th 

M.  = control  policy  vector  with  j component 

M^j  = capacity  when  j messages  are  present  at  the  beginning  of  the 
control  interval. 

X.  = input  vector  with  components 

H ..  = x.  = j possible  arrival  rate  (Poisson)  (j  = 1, 2 . . . R.) 

th 

it..  = x.  _.  . = probability  of  occurrence  of  j possible  arrival  rate 

1 J L f cj  “ 1 

Z.  = T(  Xj  » ML  ) = vector  of  output  rates  and  their  probabilities. 
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Fig.  1.  A typical  3-node  network  with  its  subsystems. 

The  definition  of  M.  implies  that  the  controller  at  each  node  "sees"  only  his  own 

1 j 

state.  Other  possibilities  which  have  been  considered  include  i)  a controller  whicn 
sees  the  states  at  its  node  and  at  all  nodes  which  feed  it;  and  ii)  a centralized  con- 
troller which  uses  the  states  of  the  queues  at  all  nodes  in  the  network. 

We  want  to  choose  M^  (i  = 1,  2 . . . N)  to  minimize  C.  The  hierarchical  approach 
defines  a sequence  of  subgoals  for  each  node.  The  second-level  computation  revises 
the  subgoals,  after  examining  the  results  of  the  N previous  subgoal  minimizations, 
in  a manner  which  aims  the  procedure  toward  the  global  network  optimization. 

The  first-level  imagines  each  node  to  be  isolated  and  free  to  choose  its  channel 
policy  Mj  and  its  arrival  process.  However,  connection  with  the  real  problem  is 
maintained  by  augmenting  the  local  cost  to 


C. 

l 


T T 

+ P,.  . Z.  - P.1  X. 

— i+  1 — i —i  — i 


<4) 


so  that  the  modified  total  network  cost  becomes 
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C = C + l P-^  (5) 

i = l 

The  P^  vectors  are  Lagrange  multipliers  or  penalty  factors  which  are  chosen  by  the 
second-level  computation  to  penalize  discrepancies  from  the  contraint  2L  j = JC  , 
corresponding  to  the  network  connection.  (Note  that  the  loop  constraint  is  incorpo- 
rated by  the  notational  definitions  P^+j  = Pj  and  Zq  = Z^  . ) Clearly,  the  { M^  } 
corresponding  to  the  global  optimum  (Min  C)  will  also  make  C = C . 

The  hierarchical  optimization  procedure  is  summarized  as  follows. 

1.  Given  an  initial  set  { FV  } , find 

C . f 1 1 = Min  C . corresponding  to  X.  f 1 1 , M .f  11 
1 1 J X.  , M.1  — J — il  i 

— i — i 

and  resulting  in  Z.  [ 1]  . 

2.  Modify  the  P^  , In  order  to  increase  the  weight  on  bit  input-output 
discrepancies,  we  can  use 

£i[2]  "fill]  + A(ZU1[1])  (6) 

As  described  in  Ref.  3 other  second-level  penalty  adjustments  may  be 
more  efficient. 

3.  Return  to  step  #1  and  compute  Xj[  2] , MJ2],  ZJ2]  using  { PJ  2]  },  etc. 

This  kind  of  procedure  can  be  shown  to  converge  if  the  original  optimization 

3 

problem  satisfies  suitable  conditions  such  as  convexity  of  the  cost  function.  Verifi- 
cation of  those  conditions  can  be  as  complicated  as  solving  the  original  optimization 
problem,  but  experiments  with  this  approach  on  this  network  problem  have  converged. 
Even  if  complete  convergence  does  not  occur,  these  steps  should  produce  reasonable 
policies  whose  effectiveness  can  be  checked  directly. 

The  network  problem  requires  imposition  of  physical  constraints  on  the  vari- 
ables in  step  # 1 of  the  optimization. 

1.  M„  e (^ii*  ^ i2  ‘ ’ ^i  k. ^ = set  of  possible  channel  capacities. 


2.  0 < x.  . < (q. .) 

— i,  2j-  1 ij  max 


3- 


1 
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Moreover,  the  mean  delay  part  of  the  cost  in  Eq,  (1)  must  be  modified  as  follows  to 
account  for  the  random  selection  of  Poisson  arrival  processes. 

Using  the  notation  p^m(p.,t)  for  the  transition  probability  from  state  i to  state 
j in  t seconds  with  arrival  rate  rj  and  departure  rate  p , the  generalized  form 
of  mean  delay  plus  channel  cost  at  node  i becomes 
K.  T K.-l 

S<H)=  i \m  fZi  j 0 dt  + « T C (p)  (8) 

m = l oj  = l J 

Similarly,  the  effective  departure  rate  in  Eq.  (2)  must  be  generalized  to 


K.-l  T m 

q.(p)=p  + 2 *lm  /pj^.tjdtl 

i_  m =1  o ; 


Minimization  of  C.  in  Eq.  (4)  using  Eq.  (8)  is  carried  out  in  two  steps.  With 
X . fixed,  the  minimization  with  respect  to  channel  policy  M.  is  carried  out  by  the 
Dynamic  Programming  approach  mentioned  earlier.  This  requires,  for  each  candi- 
date K1  . , integration  of  the  transient  probabilities  in  Eq.  (8)  for  each  arrival  rate 
dirn  . as  well  as  in  Equation  (9).  In  the  long  run,  the  cost  at  node  j per  T seconds 
will  be  a number  C^  , independent  of  the  initial  state  i . This  is  the  quantity  which 
is  augmented  by  penalty  costs  in  Equation  (4).  This  averaging  approach  contains  the 
further  implicit  assumption  (clearly  an  approximation)  that  the  arrival  rates  in  sep- 
arate T- second  intervals  are  mutually  independent. 


The  other  half  of  the  first  level  minimization  requires  a search  over  all  admis- 
sible input  vectors  X^.  The  procedure  of  Chichinadze4  was  used  to  seek  the  global 
minimum  of  a multivariable  objective  function  which  has  several  local  minima.  This 
method  is  most  easily  explained  via  minimization  of  an  S(x)  where  x is  a scalar.  A 
function  <\>  (%)  is  defined  as  the  total  width  of  x-intervals  in  which  S(x)  < £ . A curve 
passing  through  + (£)  for  several  values  % can  be  extrapolated  to  find 

£ = min  S (x) 

m x 

where  (4m)  “0.  With*/’  (£m>  as  the  center  of  gravity  of  the  x-intervals  where 
S(x)  < |,  (£)  can  be  extrapolated  to^f^)  = xv  to  estimate  the  optimizing  value  of  x. 

Both  4M£.)  and ^(4)  are  estimated  by  computing  S(x)  for  many  values  of  x. 

This  first-level  optimiation  is  very  time  consuming,  since  it  incorporates  inte- 
gration of  the  transient  queue  behavior  and  searching  for  the  minimum  of  a multimo- 
dal function  of  a large  number  of  variables.  In  the  three  node  loop  network  example  be- 
low, where  each  node  stores  a maximum  of  4 messages,  the  entire  hierarchical  opti- 
mization (inolving  five  iterations  of  the  {P. } took  about  28  minutes  on  an  IBM  360/65. 
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Example 


For  an  example  of  the  design  of  the  above  three  node  network,  the  storage  ca- 
pacities of  buffers  are  assumed  to  be  Kj  = = Kj  = 4 messages  ( 5 states  for  each  buf- 

fer), and  at  each  node,  an  arriving  message  has  a probability  of  1/2  of  departing 
from  the  network  and  of  1 / 2 of  queuing  for  transmission  to  the  assumed  next  node. 

The  arrivals  from  outside  the  network  have  rates  Xj=  1,  \2  =2,  = 1.  5 messages  per 

minute,  and  the  control  interval  is  taken  to  be  T = 2 minutes.  The  delay  control  para- 
meter for  each  buffer  is  assumed  o = 0.  4 and  the  blocking  parameter  /3  = 10.  A linear 
channel  cost  of  the  form  C(p^)  = is  used  for  all  the  buffers. 

A value  of  A = 0.  1 was  used  in  the  penalty  updating  of  Eq.  (6),  and  5 iterations 
of  that  updating  were  required  for  effective  convergence.  The  final  iteration  results 
of  channel  capacity  policies  as  well  as  extrapolated  and  actual  performance  costs  are 
listed  in  the  following  table: 


TABLE  I.  Adaptive  channel  results. 


Node 

Channel  Capacity  Policy 
(Kb/sec)  States  (0,  1, 

2,  3,  4) 

Actual 

performance 

Cost 

Extrapolated 
performance 
Cost:  | 

1 

2.  4,  2.  4,  3.  6,  6,  7.  2 

10.  18 

9.  65 

2 

3.  6,  4.  8,  6,  7.  2,  7.  8 

10.  23 

9.  36 

3 

4.  8,  7.  2,  9.  6,  9.  6,  10.  8 

15.  19 

14.  71 

I 

35.60  Total 

| 

! 

As  comparison,  the  design  of  this  three  node  network  with  optimal  fixed  channel  ca- 
pacities and  finite  storage  buffers  (Kj  = K2  =K3  = 4 messages)  was  considered.  This 
problem  was  solved  through  multilevel  optimization  techniques  of  adaptive  networks. 
The  following  table  presents  fixed  channel  capacities  as  well  as  performance  costs  of 
each  node; 

TABLE  II.  Fixed  channel  results. 


Node 

Fixed  Channel  (Kb/sec) 

Performance  Cost 

1 

6 

12.61 

2 

10.  8 

17.  96 

3 

10.  8 

16.  72 

47. 29  Total  | 
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This  example  reveals  about  a 30%  total  performance  cost  improvement  by  the  adap- 
tive network  model. 

Joint  Services  Technical  Advisory  Committee  L.  Shaw.  K Sohrabv 
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SOFTWARE  MODELING  STUDIES 
M.L.  Shooman  and  H.  Ruston 

The  material  presented  below  describes,  in  summary  fashion,  the  research  re- 
sults obtained  in  the  modeling  of  computer  software. 

A.  Major  Results 

The  major  thrust  of  the  work  is  divisible  into  three  areas:  software  reliability 
and  availability  models,  test  models,  and  complexity  models. 

Building  upon  previous  modeling  work  by  Shooman  in  the  reliability  area,  new 
macro  and  micro  reliability  models  have  been  developed,  namely: 

(1)  a Markov  availability  model; 

(2)  estimator  formulas  and  tagging  methodology  for  estimating  the  initial 
and  subsequent  number  of  errors; 

(3)  the  macro  model  was  made  more  realistic  through  the  incorporation 
of  error  generation  effects; 

(4)  a micro  model  based  upon  path  traversal  of  the  software. 

The  approach  in  the  area  of  testing  has  been  to  classify  and  model  as  well  as  to  provide 
test  techniques.  The  research  has  resulted  in: 

(1)  a new  method  of  test  data  selection; 

(2)  a classification  of  types  of  tests; 

(3)  the  construction  of  automatic  test  drivers; 

(4)  a statistical  model  of  testing; 

(5)  a model  of  an  exhaustive  test. 

Complexity  measures  have  been  sought  which  relate  problem  complexity  to  pro- 
gramming effort  and  reliability.  The  major  accomplishments  were: 

(1)  modest  advances  in  the  application  of  the  classical  theory  of  recursive 
function  to  software  problems  (the  method  has  practical  drawbacks); 

(2)  development  of  software  structural  measures  such  as  accessibility  and 
testability; 

(3)  experimental  verification  and  theoretical  proofs  that  Zipf’s  law  applies 
to  computer  languages; 

(4)  development  of  estimator  formulas  for  computing  the  Zipf's  law  com- 
plexity measure. 

B.  Applications  to  Software  Engineering 
1 . Design  Phase 

Design  is  inherently  an  iterative  process,  involving  many  separate  steps,  repeat- 
ed at  different  stages  of  the  project.  During  the  initial  stage  the  proposed  design  and 
its  alternatives  are  evaluated  for  feasibility  and  performance.  One  of  the  first  steps  is 
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to  gauge  the  overall  scope  and  feasibility  of  the  job  by  assessing  its  complexity.  The 
common  approach  is  to  equate  it  with  similar  past  systems,  on  which  data  exists,  and 
draw  comparisons.  Such  a technique  is  imprecise  and  of  diminished  usefulness  in  a 
military  environment,  where  most  new  tasks  eclipse  their  predecessors  in  complexity. 

Some  of  the  complexity  measures  developed  (cf.  Section  E.4)  are  related  to  funda- 
mental properties  of  the  problem  and  should  allow  the  extrapolation  of  past  results  into 
the  future.  To  utilize  the  complexity  model  during  early  design,  we  tabulate  the  input 
and  output  quantities,  and  estimate  what  proportion  of  the  available  operators  in  the 
chosen  language  will  be  used.  These  values  are  substituted  into  a formula  which  pre- 
dicts the  operator-operand  (token)  length.  One  then  relies  on  historical  data  for  the 
ratios  of  errors/token  and  man-hours/token.  Although  these  ratios  depend  on  historical 
data,  the  scaling  procedure  based  upon  token  complexity  is  objective,  repeatable,  and 
has  been  shown  (in  our  small  number  of  retrospective  estimates)  to  be  reasonably  ac- 
curate. 

The  reliability  model  discussed  in  Section  C.3  can  be  used  to  roughly  predict  the 
software  failure  rate  and  mean-time  between  software  errors  at  the  design  stage.  One 
of  the  inputs  to  this  model  is  the  instruction  length  of  the  program.  This  can  be  estima- 
ted from  the  token  length  and  hsitorical  data  on  the  number  of  instructions/token  in  the 
chosen  language.  An  additional  parameter  needed  in  the  model  is  the  error  discovery 
rate  constant.  This  constant  relates  the  number  of  occurrences  of  external  operational 
errors  to  the  number  of  residual  internal  errors.  (Note:  an  internal  error  is  a poten- 
tial one  which  only  becomes  an  actual  external  error  when  the  particular  data  which 
excites  it  is  processed.)  Much  of  the  research  described  in  this  report  involves  tech- 
niques for  measuring  these  model  parameters  and  testing  the  ease  and  accuracy  of  ap- 
plication of  the  above- mentioned  formulas  (cf.  C.2,  D.  2,  F.  3,  F.5,  F.ll). 

2.  Testing  and  Debugging  Phase 

The  contributions  to  this  phase  contain  several  techniques  for:  selecting  test 
data,  calculating  the  number  of  tests,  and  automatic  testing.  In  addition,  contributions 
were  made  to  classifying  software  by  the  ease  of  testing  and  classifying  tests  by  their 
degree  of  completeness. 

The  technique  for  selecting  test  data  involves  the  mapping  of  constraints  on  the 
output  into  the  equivalent  set  of  constraints  on  the  input  test  data.  A description  of  this 
technique  is  given  in  Section  D.  3. 
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The  knowledge  of  the  number  of  tests  is  essential  for  bounding  the  test  costs  and 
developing  a test  plan.  The  method  described  in  Section  D.4  consists  of  a sequence  of 
stages.  First,  the  control  structure  of  the  software  is  modeled  by  a graph.  Second, 
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matrices  representing  the  graph  structure  are  constructed.  Lastly,  matrix  manipula- 
tions are  performed  to  obtain  the  desired  number  of  tests. 

The  automatic  testing  technique  developed  in  this  contract  yields  a test  driver. 
This  driver  analyzes  the  source  code  and  modifies  it.  The  modification  allows  the  gen- 
eration of  test  data  which  exercises  the  software  in  a specific  manner.  The  modifica- 
tion is  such  that  each  IF  statement  is  associated  with  either  a bit  string  variable  of  0 or 
1,  representing  the  two  outcomes.  Similarly,  the  passage  of  a DO  WHILE  loop  is  con 
trolled.  Lastly,  various  program  modules  are  exercised  for  the  initial  values.  The 
details  of  this  method  are  given  in  Section  D.5. 

The  classification  of  software  is  based  upon  accessibility  of  certain  modules,  ease 
of  testing  such  modules,  and  the  use  of  resources  in  such  tests.  The  details  of  this 
classification  are  given  in  Section  E.3. 

3.  Operational  Phase 

Many  of  the  models  described  previously  can  also  be  applied  to  the  operational 
phase.  As  an  example,  the  release  of  large  software  systems  to  the  field  is  accompa- 
nied by  a flurry  of  software  errors.  The  field  debugging  process  can  be  controlled  by 
the  reliability  models  described  in  Section  C.  3. 

Once  steady- state  prevails  in  the  field,  the  most  significant  parameter  of  soft- 
ware performance  is  its  availability.  The  Markov  availability  models  described  in 
Section  C.  1 may  be  used  to  predict  the  steady-state  availability  of  the  software. 

The  immediate  predecessor  of  the  operational  phase  is  the  acceptance  test.  This 
test  delineates  the  transition  between  development  testing  and  operational  deployment. 
Section  F.2  describes  one  type  of  such  test.  The  philosophy  of  the  acceptance  test 
divides  the  acceptance  into  two  parts.  The  first  part  involves  testing  of  each  feature 
within  the  nominal  range.  The  second  part  involves  testing  of  stressful  conditions  as 
well  as  boundaries  of  the  range.  The  software  is  accepted  only  if  a prestated  high  per- 
centage of  the  tests  are  successful. 

Section  F.2  delineates  the  suggested  procedures  to  be  followed  if  the  software  is 
not  accepted . 

C.  Software  Error.  Reliability,  and  Availability  Models 
1 . Introduction 

We  shall  present  here  brief  summaries  of  work  documented  in  technical  reports 
and  papers  relating  to  models  of  software  errors  and  their  effect  on  operational  relia- 
bility and  availability.  These  models  are  needed  for  a number  of  technical  and  manage- 
ment functions,  namely: 
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(a)  To  estimate  the  cost  and  development  time  related  to  the  excision  of  the 
errors  initially  residing  in  the  program. 

(b)  To  provide  an  analysis  technique  for  one  of  the  three  decisions,  that  is, 

(I)  to  continue  more  testing,  (2)  to  terminate  testing  and  to  accept  the 
software,  and  (3)  to  abort  a hopeless  effort. 

(c)  To  provide  a technique  for  cost- reliability  trade-off  among  competitive 
designs.  Such  trade-off  calculations  are  needed,  for  example,  to  choose 
between  a time  consuming  technique  which  produces  software  with  few 
errors  and  a fast  technique  leading  to  many  errors. 

(d)  To  provide  definitions  and  techniques  for  operational  data  collection  and 
measurement.  The  collected  data  is  needed  for  the  determination  of 
model  parameters  and  the  measurement  of  operational  system  reliability. 

(e)  To  provide  quantitative  measures  of  goodness  which  enable  researchers 

to  assess  the  promise  of  a new  technique.  This  can  be  done  by  contrasting 
the  error  content  and  reliability  of  software  constructed  with  and  without 
such  a technique. 

The  research  supported  under  this  contract  has  advanced  the  field  by  providing: 

(a)  a Markov  model  of  availability, 

(b)  an  estimate  of  initial  error  content  of  software  based  upon  a tagging  pro- 
cedure, 

(c)  a software  reliability  model  incorporating  errors  generated  during  the 
debugging  process, 

(d)  a micro  reliability  model  based  upon  path  traversal  of  the  software. 

2 . Reliability  Models 

The  initial  thrust  of  modeling  focussed  on:  (1)  the  number  of  software  errors, 

(2)  the  rate  of  their  removal  during  the  testing  phase,  and  (3)  the  operational  failure 
rate  resulting  from  residual  errors.  This  work  leads  to  three  phases  of  models,  to  be 
described  here. 

Macro  Models 


The  model  developed  in  the  first  phase  portrayed  just  the  gross  features  of  inter- 
est. It  provided  expressions  for  the  number  of  remaining  errors  as  the  debugging 
process  unfolded.  Certain  reasonable  assumptions  were  made  to  relate  the  number  of 
residual  errors  to  the  operational  system  reliability.  One  of  the  assumptions  postulated 
the  constancy  of  the  sum  of  the  removed  errors  and  the  remaining  errors. 

The  model  of  the  first  phase  was  named  the  macro  model,  because  it  only  con- 
sidered the  gross  or  exterior  behavior  of  the  removal  process.  Such  a model  is  analo- 
gous to  modeling  the  exterior  input-output  relationships  of  the  system  without  regard 
to  the  interior  structure  of  the  system.  The  macro  model  is  described  in  References 
13,  14  and  15. 


as 
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Macro  Model  with  Error  Generation 

The  macro  model  was  refined  in  the  second  phase.  The  refinement  consisted  of 
incorporating  the  errors  generated  during  the  debugging.  The  generated  errors  were 
modeled  in  several  ways.  Their  rate  of  removal  was  modeled  as  a function  of  the 
project's  manpower,  giving  rise  to  a related  investigation  of  manpower  deployment 
strategies.  The  complete  development  of  the  error  generation  model  and  the  effect  of 
deployment  strategies  is  described  in  Reference  7. 

Micro  Model 

The  completion  of  the  second  phase,  just  described  above,  indicated  that  further 
improvement  in  the  fidelity  of  the  model  will  necessitate  the  incorporation  of  factors 
related  to  the  structure  of  the  software.  The  model  which  was  investigated  is  based 
upon  the  program  flowcharts  (or  equivalent  representations,  e.g.,  pseudo- code),  its 
paths,  and  the  execution  and  the  error  frequencies  along  each  path.  Such  a model  is 
based  on  the  assumption  that  the  software  system  can  be  decomposed  into  a modest  num- 
ber of  mostly  independent  paths  (structured  programs  generally  satisfy  this  assumption). 
Any  gross  failure  of  the  decomposibility  or  independence  features  invalidate  such  a 
model.  The  micro  model  is  described  in  Reference  16. 

3-  Techniques  for  Estimation  of  Error  Counts 

A method  has  been  developed  for  the  estimation  of  errors  residing  in  software  at 
any  stage  of  its  construction.  This  method  is  based  on  either  seeding  or  tagging  of 
errors  in  the  program  under  construction.  The  initial  idea  of  seeding  comes  from  sta- 
tistical techniques  for  estimating  the  size  of  wild  life  population,  and  was  first  adapted 
by  H.  Mills.  Mills  experimented  with  the  addition  of  seeded  bugs  to  those  naturally 
residing  in  the  program.  After  partial  debugging  by  another  programmer  (i.e.,  one 
who  does  not  know  of  the  seeded  bugs),  the  bugs  found  are  separated  into  the  seeded  and 
the  natural  categories.  By  using  the  total  number  of  seeded  bugs,  the  number  of  seeded 
bugs  found,  and  the  number  of  natural  bugs  found,  Mills  obtained  estimates  for  the 
initial  number  of  natural  bugs.  A major  problem  in  applying  this  technique  is  the  dif- 
ficulty in  "manufacturing"  realistic  bugs  for  seeding. 

To  avoid  this  difficulty  it  has  been  suggested,18  that  instead  of  seeding  program 
errors,  a tagging  technique  be  used.  This  technique  is  implemented  by  having  the  pro- 
gram debugged  independently  by  two  or  more  testers.  A different  person,  acting  as  the 
analyst,  analyzes  the  lists  of  bugs  found  and  identifies  common  bugs  found  by  two  or 
more  programmers.  The  number  of  common  bugs  found  by  multiple  programmers 
serves  the  same  purpose  as  the  seeded  bugs  in  the  former  techniques.  The  disadvantage 
of  the  method  is  the  need  for  additional  personnel  (i.e.,  the  independent  debuggers). 
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However,  this  cost  is  largely  offset  by  the  extra  bugs  that  the  additional  personnel  finds. 

The  tagging  method  has  a number  of  advantages  over  the  seeding  one: 

(a)  The  difficulty  of  manufacturing  realistic  seeded  bugs  is  eliminated. 

(b)  None  of  the  testing  and  debugging  funds  are  used  for  the  finding  of 
known  seeded  bugs. 

Because  of  the  promise  of  the  method,  a major  effort  was  undertaken  resulting  in 

g 

a comprehensive  report.  This  report  derives  several  estimator  formulas  for  mean 
and  variance  for  different  sample  sizes.  The  report  also  describes  how  the  method  is 
to  be  used  for  the  case  of  two,  and  for  the  case  of  several,  debuggers.  A small  scale 
experiment  has  been  performed,  and  the  data  is  presently  being  analyzed. 

The  method  is  especially  useful  in  estimating  the  initial  number  of  bugs,  to  pro- 
vide measures  (1)  of  the  cost  of  testing  and  (2)  of  a parameter  in  the  reliability  model. 

It  is  also  useful  for  obtaining  information  for  management  decisions  at  certain  key 
points  of  software  development. 

4.  Markov  Availability  Models 

Models  have  been  developed^  for  the  determination  of  availability  of  an  operation- 
al (i.e.,  field  deployed)  computer  software.  For  many  systems  the  reliability  is  not 
the  crucial  measure,  but  rather  the  ratio  of  up-time  to  the  total  operating  time,  that  is, 
the  system  availability. 

The  models  developed  are  based  upon  Markov  processes.  The  initial  upstate 
assumes  the  system  to  be  running  with  undiscovered  bugs.  When  the  first  software  er- 
ror is  discovered,  the  system  enters  the  second  up- state  which  represents  n- 1 bugs. 

The  model  thus  consists  of  a sequence  of  up  and  down  states.  The  transition  probabil- 
ities between  up  and  down  states  are  essentially  the  respective  software  rates.  The 
transition  probabilities  between  up  and  down  states  are  the  probabilities  of  repair  taking 
place  in  a given  time  interval. 

The  model  leads  to  difference  and  differential  equations  for  the  state  probability 
(e.g.,  the  probability  that  the  system  is  in  a particular  up  or  down  state).  The  reports 
show  solutions  for  several  different  cases  involving  various  assumptions  on  the  failure 
and  repair  rates. 

D.  Test  Models  and  Techniques 

1 . Introduction 


We  shall  present  here  brief  summaries  of  work  documented  in  technical  reports 
and  relating  to  test  models  and  techniques.  These  models  are  needed  for  a number  of 
technical  and  management  functions,  namely: 
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(a)  To  obtain  parameter  values  for  certain  models  described  in  Section  4. 

(b)  To  classify  and  model  the  testing  process. 

(c)  To  suggest  new  means  and  strategies  of  testing. 

(d)  To  estimate  the  costs  of  software  testing. 

(e)  To  manage  and  measure  progress  during  the  test  phase  of  software 
development. 

The  research  supported  has  advanced  the  field  by  providing: 

(a)  A New  method  of  test  data  selection,  _ __ 

(b)  A classification  of  types  of  tests, 

(c)  The  construction  of  automatic  test  drivers, 

(d)  A statistical  model  of  testing, 

(e)  A model  of  an  exhaustive  test. 

2.  Small  Scale  Testing 

In  order  to  explore  the  verification  of  theoretical  models  (e.g.,  the  tagging  esti- 
mates), and  to  estimate  parameters  needed  for  other  models,  four  programs  have  been 
written  by  student  programmers  and  careful  records  were  kept  on  their  debugging  ex- 
perience. The  initial  effort  and  the  four  programs  are  described  in  Ref.  5 (pp.  22-32), 
along  with  the  reporting  scheme. 

The  four  programs  were  constructed  in  increasing  order  of  complexity:  (1)  salary 
payroll  adjustment,  (2)  roots  of  a cubic  equation,  (3)  library  search,  and  (4)  ballot 
counting  procedure.  Each  debugger  recorded:  (1)  the  number  of  bugs  and  their  types 
discovered  during  each  test  run,  (2)  the  time  required  for  analysis  of  each  error  and 
the  computer  time  of  the  run  needed  to  correct  the  error,  and  (3)  the  chronology  of  re- 
moval of  inherent  bugs  and  bugs  generated  during  the  testing  phase. 

As  the  data  reduction  continues,  we  intend  to  utilize  the  new  measure  of  complex- 
ity (i.e.,  Zipf  Law,  see  Ref.  9)  to  explore  the  relationship  between  complexity  and  the 
initial  number  of  program  bugs.  We  also  intend  to  correlate  the  Zipf's  Law  measure 
of  complexity  with  intuitive  subjective  estimates  of  relative  complexity  by  programmers 
(e.g.,  concensus  estimation,  see  Reference  19). 

3 . Data  Selection  Studies 

The  crucial  problem  in  testing  is  the  selection  and  generation  of  an  appropriate 
set  of  test  data.  This  difficult  problem  has  been  studied  from  several  viewpoints.  One 
such  view  will  be  described  here;  several  other  viewpoints  appear  in  the  sections  to 
follow. 

The  method  used  in  this  study  uses  constraints  imposed  on  the  variables.  Tracing 
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of  all  constraints  resulted  in  the  determination  of  bounds  on  the  variables,  and  thus  con- 
struction of  test  input  data.  The  method  requires  the  construction  of  dependency  tables 
and  has  been  described  in  Ref.  1 (pp.  10-12)  and  in  Reference  2 (pp.  33-50).  A more 
complete  treatment  is  given  in  Reference  20.  At  present  the  method  is  very  tedious  be- 
cause of  a laborious  hand- tracing  through  the  dependency  tables.  Future  efforts  will 
attempt  to  mechanize  the  effort  (e.g.  , by  perhaps  writing  a program  replacing  the 
hand-tracing)  and  to  apply  it  to  modest  size  programs.  Even  though  the  present  method 
is  traceable  to  programs  of  any  size,  the  tediousness  of  the  hand-tracing  makes  it  im- 
practical for  other  than  small  programs. 

4.  Determination  of  the  Number  of  Tests 

One  of  the  pertinent  questions  of  interest  is  the  number  of  tests  required  of  soft- 
ware, so  as  to  verify  such  software  to  a desired  degree  of  completeness.  A complete 
verification  requires  perhaps  an  exhaustive  test,  that  is,  a test  for  all  possible  sets  of 
inputs.  Obviously,  such  a test  is  almost  always  impractically  large.  Consequently, 

practical  considerations  demand  lesser  levels  of  testing.  Hierarchies  of  such  tests 

% 

have  been  categorized  (Ref.  21,  Chapter  4)  and  assigned  level  numbers  from  0 to  4. 
Type  0 test  exercises  each  program  statement.  The  highest  level  is  an  exhaustive  test. 
The  in-between  levels  represent  tests  more  complete  than  the  0 type  but  not  exhaustive 
(e.g.,  level  2 is  execution  of  all  paths  in  the  program).  Also  intermediate  levels  can 
be  introduced,  such  as  type  1.5  for  example,  which  identify  all  the  levels  of  the  one 
level  and  some  of  the  second  level. 

We  have  investigated  analytical  methods  for  computing  the  number  for  type  2 
tests.  This  work  was  reported  in  Ref.  2 (pp.  3-11),  Ref.  3 (pp.  40-49),  and  Reference 
4 (pp.  10,  14-16).  A more  detailed  discussion  is  given  in  a technical  report  presently 
in  preparation.  The  method  starts  with  a graph  of  the  control  flow  of  the  program. 
Theorems  from  graph  theory  have  been  applied  to  determine  the  number  of  type  2 tests. 
This  entails  visual  search  through  graphs,  which  is  very  cumbersome  for  large  prob- 
lems. The  alternate  approach  is  to  represent  the  control  graph  by  a connectivity  ma- 
trix. Such  a matrix  can  then  be  manipulated  via  matrix  transformations  to  yield  the 
desired  number.  The  matrix  method  is  amenable  to  computer  processing. 

5 . Automatic  Test  Drivers 

Another  approach  to  the  generation  of  test  inputs  evolved  in  the  construction  of  a 
so-called  test  driver.  Such  a driver  is  a generalized  program.  The  input  to  the  driver 


Two  different  scales  for  levels  have  been  introduced.  The  earliest  scale  went 
from  0 to  3.  The  newer  scale  extends  from  0 to  4. 
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is  the  program  to  be  tested.  The  driver  generates  signals  necessary  for  forcing  the 
traversal  of  each  program  path. 

The  initial  driver  was  implemented  in  LISP  and  is  described  in  Reference  5 (pp. 
14-17).  Further  work  is  reported  in  References  12  and  22.  These  references  describe 
the  progress  achieved  in  the  development  of  the  driver.  In  particular,  the  present  im- 
plementation is  in  PL/l,  which  allows  driver  testing  of  wider  classes  of  programs. 

The  initial  version,  for  example,  was  restricted  to  the  loopless  program;  the  present 
version  no  longer  suffers  this  constraint. 

The  essence  of  the  algorithm  is  to  associate  a binary  digit  with  the  predicate 
(i.e.,  condition)  of  each  decider  (implemented  in  PL/l  by  the  IF  statement).  The  state 
of  all  the  deciders  can  then  be  represented  by  a binary  number  made  up  of  an  ordered 
concatenation  of  the  individual  digits,  with  each  digit  representing  a single  decider. 

The  algorithm  generates  the  entire  valid  set  of  such  binary  numbers,  and  derives  the 
appropriate  driving  signals  from  them. 

Similar  techniques  are  used  to  reduce  the  testing  of  a DO  loop  to  just  the  first  and 
the  last  passes.  Such  a reduction  greatly  reduces  the  run  time  of  a complete  test.  For 
the  details,  the  reader  is  again  referred  to  the  cited  references. 

It  should  be  observed  that  the  assumptions  on  the  control  structure  being  composed 
of  only  simple  sequences,  repetitions,  and  selections,  are  satisfied  by  a structured 
program.  With  the  present  emphasis  on  usage  of  structured  programming  techniques, 
these  assumptions  are  not  very  limiting. 

6 . Statistical  Test  Models 

A statistical  test  model  has  been  developed  which  relates  different  program  er- 
rors to  the  input  data  set  (or  sets)  which  excite  and  thus  display  a particular  error. 

The  model  also  gives  the  probability  that  these  errors  will  cause  the  program  to  fail. 
This  work  was  reported  in  References  3 (pp.  19-30)  and  4 (Section  3.7).  A more  de- 
tailed description  of  the  model  appears  in  Reference  23. 

The  model  assumes  that  there  are  N total  possible  test  inputs  (i.e.,  input  data 
sets).  It  is  further  assumed  that  each  input  is  equally  likely  to  occur  either  as  the 
tester's  or  the  user's  input.  A certain  number  of  these  inputs,  say  w,  will  result  in 
operational  errors.  During  the  development  of  the  program  only  the  subset  t of  the  N 
possible  inputs  is  used  to  test  the  program.  The  model  relates  Pe,  the  probability  of 
the  error  occurring  during  the  program  use,  to  the  parameters  W,  N,  and  t.  The  model 
also  yields  a rectangular  grid  representing  the  test  process.  Each  grid  point  (j,i)  is 
interpreted  as  the  excitation  of  the  bug  i by  the  input  j.  The  graph  also  portrays  the 
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two  situations  when  (1)  a single  input  excites  several  bugs,  and  (2)  different  inputs  ex- 
cite the  same  bug.  These  two  situations  are  displayed  by  either  a horizontal  or  verti- 
cal interconnection  of  grid  points. 

A related  study  utilizes  a similar  model  to  evaluate  the  probability  that  a particular 
sequence  excites  a bug.  This  work  is  reported  in  Reference  4 (Section  3.7). 

7 . The  Exhaustive  Test  Model 

It  is  obvious  that  an  exhaustive  test  is  prohibitive  in  time  and  cost  for  almost  all 
practical  programs.  Nevertheless,  the  number  of  tests  for  an  exhaustive  test  plan 
still  represents  a useful  upper  bound  on  the  actual  number  of  tests  that  will  be  used  in 
practical  testing.  These  upper  bounds  have  use  in  comparing  the  test  efforts  needed  to 
test  different  programs. 

The  knowledge  of  the  number  of  tests  needed  for  an  exhaustive  test  is  needed  if 
one  wishes  to  use  test  coverage  (that  is,  the  number  of  the  tests  performed  to  the  num- 
ber of  possible  tests),  as  a figure  of  merit.  Because  of  these  applications,  the  model- 
ing of  an  exhaustive  test  was  studied  and  the  results  are  reported  in  References  12,  21, 

22  and  24. 

The  exact  computation  of  the  number  of  tests  was  made  in  Ref.  24,  for  a particular 
example  of  a small  program.  In  this  example,  the  program  extracted  the  roots  of  a 
quadratic  equation,  and  was  designed  in  PDP-8  assembly  language.  The  number  of  ex- 
haustive tests  depended  upon  the  range  of  possible  values  for  the  three  coefficients  of 
the  quadratic  and  the  12  bit  word  length  of  the  computer. 

E.  Complexity  Models 
1 . Introduction 

The  fundamental  problem  in  the  development  of  software  is  to  construct  a product 
of  minimum  cost  which  still  meets  the  quality  and  reliability  design  specifications. 

When  the  product  is  completed  records  often  show  that  the  development  costs  greatly 
exceed  the  initial  projection.  A major  cause  of  the  poor  estimation  is  the  strong  influ* 

• ence  of  complexity  and  the  difficulty  in  measuring  and  assessing  its  effects. 

Complexity,  to  be  more  specific,  has  a major  effect  on: 

(a)  development  costs , 

(b)  development  time, 

(c)  amount  of  testing, 

(d)  number  of  bugs, 

(e)  memory  needs, 

( f)  size  of  host  computer. 
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(g)  skill  and  number  of  programmers, 

(h)  maintenance, 

( i)  life-cycle  costs, 

and  many  others.  Complexity  is  a very  elusive  type  of  measure,  with  its  importance 
perceived  by  everyone  but  in  an  ambiguous  and  fuzzy  fashion. 

Complexity  affects  all  the  above  listed  developmental  parameters  in  a nonlinear, 
hard-to-determine  way.  Clearly  one  needs  a more  sophisticated  measure  than  a mere 
count  of  the  number  of  lines  of  code.  There  are  many  examples  of  simple  large  pro- 
grams and  complex  small  programs.  The  problem  of  complexity  is  a difficult  one  as 
is  evidenced  by  the  number  of  highly  competent  researchers  working  on  this  problem 
and  the  relatively  few  important  results  published  so  far . 

The  research  supported  under  this  contract  has  advanced  the  field  by  providing: 

(a)  A comprehensive  study  of  classical  methods  of  complexity  with  emphasis  on 
recursive  function  theory.  The  results  were  theoretically  appealing:  how- 
ever, practically  difficult  to  apply  and  extend. 

(b)  An  approach  in  which  software  complexity  is  decomposed  into  simpler  soft- 
ware components.  One  such  component  is  accessibility,  which  is  a meas- 
ure of  the  software  structure.  Another  component  is  testability,  which  is 
the  ease  with  which  a module  may  be  accessed  and  tested.  A third  compo- 
nent is  testedness,  which  measures  the  frequency  of  execution  during  the 
testing  phase.  This  approach  shows  promise  but  more  work  is  needed. 

(c)  A measure  of  complexity  based  on  Zipf's  Law  and  other  results  in  natural 
language  theory.  This  approach  links  known  results  from  natural  language 
and  information  theories  to  software  complexity  questions.  (The  paper  on 
this  subject  (Ref.  25)  has  received  the  best  paper  award  at  the  Fall  1977 
IEEE  COMPCON  conference.)  The  results  give  answers  to  such  questions 
as  the  expected  length  of  program  given  the  initial  estimate  of  the  number 

of  variables  and  the  types  of  needed  algorithms.  The  formulas  derived  from 
fundamental  principles  give  answers  which  correlate  well  with  those  obtained 
independently  through  software  science  formulas  (Reference  27). 


2.  Application  of  Recursive  Function  Theory  to  Program  Complexit 


The  initial  study  of  software  complexity  focused  on  the  theories  of  computational 
complexity,  with  the  intent  of  extending  such  theories  to  realistic  programming  prob- 
lems. Recursive  function  theory  was  selected  as  the  most  promising  candidate  for 
such  a study. 

Strictly  speaking,  recursive  function  theory  applies  solely  to  mathematical  func- 
tions. However,  the  programming  solution  to  a problem  often  can  be  viewed  as  a func- 
tion or  functional.  Thus,  the  first  question  of  interest  is  how  to  reduce  a general 
program  to  a function.  In  theory,  each  output  variable  can  be  related  to  input  variables, 
but  the  practical  problem  of  obtaining  the  mathematical  functional  relationship  is  not  an 
obvious  one.  This  is  further  complicated  in  that  classical  recursive  function  theory 
deals  with  integer  variables. 
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In  our  work  we  extended  recursive  functions  to  the  domain  of  non-integers  and 
character  strings  3 Unfortunately,  the  method  is  difficult  to  apply  and  suffers  from 
several  limitations.  Consequently,  we  conclude  that  the  method  is  of  limited  use  for 
classifying  practical  problems  in  terms  of  their  complexity. 

3 . Component  Measures  for  Evaluation  of  Software  Complexity 

One  approach  to  measuring  complexity  is  to  decompose  it  into  simpler,  easier  to 
comprehend,  components.  Such  an  approach  has  been  undertaken  and  in  fact,  some  of 
the  more  promising  measures  can  also  be  applied  to  determining  the  quality  of  software. 

The  first  component  developed  and  studied  was  named  accessibility.  Accessibility 
measures  how  easily  a module  can  be  reached  and  thus  describes  the  difficulty  in  test- 
ing such  a module  (Reference  3,  pp.  53-55).  The  second  component  was  named  testa- 
bility. This  component  used  both  accessibility  as  well  as  a measure  of  resources  re- 
quired to  test  a program  module  (Reference  3,  pp.  55-56).  The  third  component  was 
named  testedness  and  was  defined  as  a function  of  testability  and  the  frequency  of  execu- 
tion during  the  testing  phase  (Reference  3,  pp.  56-57). 

Additional  work  on  these  concepts  is  described  in  References  20  and  26.  This 
work  is  very  promising  but  more  effort  is  needed  on  obtaining  more  components  and 
correlating  them  with  the  overall  software  complexity. 

4.  Language  Theory  Measures  of  Complexity 

The  use  of  number  of  statements  as  a measure  of  complexity  is  often  used.  It  is 

recognized  that  such  a simplistic  measure  does  not  truly  gauge  the  program.  Often 

there  are  programs  with  a few  lines  considerably  more  complex  than  programs  with 

27 

many  lines.  As  a better  measure  Halstead  suggested  operator  and  operand  count. 

The  analogy  between  operators,  operands  and  verb  nouns  suggested  the  application  of 
Zipf's  Law  from  natural  languages  to  programming  languages. 

The  relationships  between  number  of  operators  and  operand  types,  as  well  as  their 
frequency  of  occurrence,  was  shown  to  follow  the  basic  Zipf's  Law.  An  extended  form 
of  Zipf's  Law  has  been  derived  which  more  closely  models  some  of  the  experimental 
data . 

The  theory  culminates  in  an  equation  which  relates  operator  and  operand  length 
9 25 

to  the  number  of  types.  ’ If  we  estimate  early  in  the  design  the  number  of  input  and 
output  variables  and  operators  (which  will  be  used  in  the  program)  we  can  obtain  an 
estimate  of  the  program  length. 

These  results  correlate  well  with  the  results  which  Halstead  has  obtained  using 
software  science  techniques  (Ref.  21,  Chapter  3 and  Reference  27). 
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F.  Other  Research  in  Progress 


1.  Introduction 

As  is  usual  in  any  research  endeavor,  there  are  a number  of  tasks  which  are  in 
different  stages  of  completion.  Some  of  these  represent  new  work,  not  reported  in 
previous  progress  summaries.  Other  work  has  been  reported  earlier,  but  has  not  as 
yet  reached  the  point  where  a comprehensive  technical  report  is  justified. 

The  brief  sections  given  below  describe  such  tasks.  These  descriptions  have 
been  included,  to  record  the  ideas  and  their  stage  of  progress,  in  order  to  provide 
continuity  in  the  follow-up  research. 

The  eleven  technical  sections  of  this  chapter  can  be  broadly  classified  into  the 
following  categories: 


1.  Models:  Sections  F.  2,  F.  3,  F.  4 

2.  Complexity:  F.  4,  F.  5 


3.  Software  Management  Tools:  F.  6,  F.  7 

4.  Data  Collection:  F.  8,  F.  9,  F.  10,  F.ll 

5.  Design  Techniques:  F.  12 


Each  of  these  sections  is  self-contained  with  appropriate  literature  references  to 
related  work.  It  is  anticipated  that  several  of  these  will  be  developed  into  comprehen- 
sive technical  reports,  while  others  will  be  incorporated  in  future  research. 

2.  Acceptance  Test  Models 


Most  of  our  research  on  testing  has  involved  the  development  phase  of  software 

production  1 1#  12,  22,  and  24  T.  • ..  i 

inis  section  describes  some  preliminary  work  which 

we  have  done  on  the  structure  of  an  acceptance  test.  Such  a test  is  one  of  the  most 
important  milestones  in  the  management  of  software  development.  An  acceptance  test 
is  basically  a vehicle  whereby  the  contractor  convinces  the  contractee  that  the  soft- 
ware is  good,  correct,  and  should  be  accepted.  The  test  which  we  have  proposed  is 
of  two  parts. 


A group  of  k test  cases  are  selected  which  test  each  feature  or  mode  of 
operation  of  the  program.  All  parameters  are  selected  well  within  the 
normal  range  of  operation. 


A collection  of  n test  cases  are  devised  such  that  they  include  all  known 
extreme  and  difficult  cases  which  constitute  n tests;  the  remaining  (n-n  ) 
cases  are  distributed  over  the  normal  range  ol  operation.  ^ 
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The  specific  test  cases  are  devised  by  the  contractee  (or  his  representative  if 
third  party  testing  is  used)  and  are  unknown  to  the  contractor.  If  all  k cases  run  suc- 
cessfully, and  at  least  x%  of  the  n test  cases,  then  the  software  is  accepted;  however, 
the  contractor  is  given  a short  amount  of  time  to  fix  the  n(l-x/l00)  errors.  If  either 
part  1 fails  cr  part  2 does  not  meet  the  required  threshold  of  x%  successful  tests  the 
contractor  is  given  time  to  improve  the  software.  After  the  software  improvement  the 
acceptance  test  (i.  e. , both  parts)  is  rerun  with  different  test  cases. 

Work  is  presently  continuing  on  these  ideas  with  regard  to  identifying  the  contrac- 
tor and  contractee  risks  and  relating  the  results  to  the  software  reliability. 


3.  Comparison  and  Test  of  Software  Reliability  Models 

Major  questions  in  the  research  and  application  of  software  reliability  models 
concerns  the  basic  assumption,  range  of  applicability,  and  ease  of  use  of  one  model 
versus  another.  A good  summary  of  the  models  proposed  in  the  past  and  their  com- 
parison is  given  in  References  28  and  29. 

Answers  to  the  above  questions  can  be  developed  in  several  ways.  One  obvious 
approach  is  the  theoretical  investigation  of  the  models,  their  underlying  assumptions, 
and  their  limitations  using  mathematical  statistics.  This  approach,  taken  by  several 
workers  (cf.  Ref.  14),  suffers  from  drawbacks.  In  some  cases  the  computations  are 
intractable.  In  other  cases  it  is  difficult  to  assess  the  practical  impact  of  the  non- 
existence of  an  entity  (e.  g. , a particular  moment)  or  the  bias  of  an  estimator. 

The  best  overall  test  of  such  models  is  to  use  them  for  the  analysis  of  sufficient 
actual  field  data  with  well-known  outcome.  If  the  predictions  by-and-large  agree  with 
the  field  experience,  we  will  be  less  concerned  with  the  mathematical  quirks.  How- 
ever, we  never  have  field  data  in  sufficient  varieties  to  relieve  our  concern. 

In  order  to  supplement  the  field  data,  we  propose  to  generate  additional  data  by 
3 0 

simulation.  We  may  fabricate  data  of  particular  types  to  pin-point  and  study  certain 
model  weaknesses.  Suppose,  for  example,  that  a reliability  model  is  suspected  of 
being  insensitive  to  changes  in  failure  rate  in  time.  We  can  generate  three  sets  of 
simulated  data;  (1)  with  constant  failure,  (2)  with  linearly  increasing  failure,  (3)  with 
exponentially  decreasing  failure.  We  can  now  study  how  the  model  responds  to  these 
three  pure  data  patterns.  Such  study  is  not  possible  with  real  data  which  contains  many 
patterns. 


4.  Application  of  Graph  Theory  to  Software  Reliability 

The  application  of  graph  theory  to  problems  in  software  reliability  was  studied. 


The  motivation  for  this  study  is  the  fact  that  linear  graphs  (i.  e. , vertices  connected 
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by  edges  in  various  configurations),  suggest  themselves  as  logical  structures  for  rep- 
resenting computer  programs.  However,  the  detailed  application  of  these  concepts 
is  not  obvious.  To  be  sure,  some  elementary  approaches  suggest  themselves  at  the 
outset,  and  the  flowchart  concept,  so  basic  to  computer  program  documentation,  is  very 
close  to  a linear  graph.  For  example,  the  vertices  of  the  graph  may  be  operational 
symbols  of  the  chart,  and  similarly,  the  graph  edges  are  the  flow  lines  of  the  chart. 
Alternatively,  the  vertices  may  represent  states  reached  in  the  program,  and  the 
edges  indicate  the  passage  between  stages  (see  Ref.  21,  pp.  3 — 7 3ff ) . Such  models  can 
be  used  to  study  the  path  structures  in  the  code.  This  in  turn  can  relate  to  the  kind  of 
testing  required  to  achieve  various  levels  of  software  reliability.  Covering  a graph 
with  paths  and  related  concepts  form  a standard  part  of  graph  theory.  The  interplay 
between  this  material  and  software  testing  algorithms  has  received  attention  from  a 
number  of  investigators. 

Graph  theory  may  contribute  to  other  aspects  of  software  reliability,  as  well  as 
forming  the  basic  representational  structure  just  described.  Several  such  aspects 
have  been  considered.  These  are  briefly  described  below  for  possible  future  use  even 
though  they  were  not  carried  beyond  the  initial  stages.  These  aspects  deal  with  com- 
plexity and  fundamental  characterization  of  software. 

As  in  the  case  of  hardware,  complexity  as  a concept  carries  both  positive  and 
negative  aspects  for  reliability.  On  the  one  hand  complex  systems  may  be  expected  to 
be  less  reliable  than  simple  ones;  on  the  other,  a major  technique  for  improving  re- 
liability is  enhancing  system  design  which  may  increase  complexity.  It  becomes 
clear  that  a conceptual  use  of  "complexity"  is  not  satisfactory,  but  specific  and  quanti- 
fiable definitions  are  required.  In  graph  theory  there  are  several  definitions  of  graph 
complexity,  the  most  well-known  being  cyclomatic  complexity  (discussed  in  Ref.  21, 
pp.  3-73ff  and  Reference  32).  Two  other  measure  of  complexity  are  well  defined.  One 

uses  the  information  content  in  a graph  and  its  underlying  automorphic  group  structure 

33 

and  was  developed  largely  by  Monshowitz.  Another  approach  to  complexity  has  been 
formulated  by  Minoli  and  is  described  in  Reference  34.  He  specifies  a set  of  desirable 
properties  for  a complexity  measure  and  defines  its  mathematical  form.  These 
measures  of  complexity  were  not  developed  with  software  applications  in  mind.  It  is 
likely  that  more  appropriate  measures  can  be  produced  by  starting  from  basic  needs  of 
software  analysis. 

As  another  approach  to  complexity,  consider  activity  on  the  linear  graph  or  net- 
work (representing  computer  software)  rather  than  the  static  (logical)  structure  alone, 

as  is  the  case  for  the  complexity  measures  mentioned  above.  One  such  approach  is 

35 

due  to  Flynn  and  is  discussed  in  detail  in  his  dissertation. 
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Though  most  people  see  at  once  a connection  between  software  code,  flow  charts, 
and  linear  graphs,  deep  consideration  exposes  a lack  of  fundamental  characterization. 
One  wishes  to  employ  mathematics  to  the  study  of  a variety  of  computer  software 
problems,  but  in  fact,  it  is  not  clear  what  kind  of  mathematical  object  represents  a 
particular  computer  software.  Indeed,  the  representation  may  differ  depending  upon 
need  and  intended  usage.  For  example,  consider  the  characterization  of  complexity. 
How  should  one  represent  the  software  and  then  impose  a goal  oriented  definition  of 
complexity  upon  the  resulting  mathematical  object?  In  any  case,  regardless  of  specific 
representations,  one  should  have  a mathematical  characterization  of  the  software. 

Study  indicates  that  such  characterization  would  be  likely  to  follow  set  theoretic  formu- 
lations. This  in  turn  leads  to  linear  graphs  which  have  a direct  correspondence  to 
subset  structures.  This  basic  characterization,  however,  has  not  been  completed  as 
yet. 

An  aid  to  the  study  of  the  interplay  between  graphs  and  code  is  the  automatic 
generation  of  linear  graphs  which  represent  computer  code.  A program  to  take 
FORTRAN  code  and  produce  a graph  from  it  was  started  using  PL/l.  The  procedure 
looks  promising  and  will  be  further  studied  in  the  future. 

5.  Correlation  of  Program  Errors  with  Complexity  Measures 

In  order  to  assess  the  validity  of  complexity  measures  advanced  in  Refs.  9 and 
27,  a data  collection  has  been  planned  using  an  IBM  370-125  computer  operating  under 
DOS.  All  output  generated  by  the  computer  is  stored  on  the  output  queue  of  the  opera- 
ting system  generated  file  POWERQ.  The  process  is  cumulative  and  continues  until 
POWERQ  is  filled,  which  takes  approximately  one  week. 

As  a next  step  the  POWERQ  output  file  will  be  run  through  a program  which 
scans  it  for  FORTRAN  programs.  When  a FORTRAN  program  is  found,  the  program 
copies  the  job  card  information,  then  continues  scanning  for  syntax  and  run  time 
errors.  These  are  identified  by  the  prefix  ILF  in  the  output  queue.  There  are  a total 
of  301  such  errors  detectable  by  the  system.  The  errors  are  copied  after  the  job  card 
information.  The  process  continues  until  the  entire  POWERQ  file  has  been  scanned. 

The  sought-after  information  is  copied  onto  a tape. 

The  resulting  tape  will  then  be  sorted  according  to  job  number,  name  and  program 
number,  which  will  provide  unique  identification  of  the  program.  After  sorting,  the 
tape  will  be  merged  with  a master  tape  on  which  the  following  information  will  be 
accumulated: 

1.  A count  of  all  runs  which  contain  syntax  errors.  Syntax  errors  are  those 
with  error  numbers  from  1 to  206. 
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2.  A count  of  all  runs  which  contain  run  time  errors.  These  are  identified  by 
error  numbers  207  to  301. 

3.  A count  of  all  error  free  runs  . It  will  be  assumed  that  the  number  of 
logical  errors  is  one  less  than  the  number  of  error  free  runs. 

4.  A total  count  of  all  runs. 

The  final  version  of  each  program  will  be  collected  from  the  students  and  read 
onto  a tape  using  the  job  card  with  which  it  was  run.  These  programs  will  then  be  run 
through  the  modified  version  of  a program  obtained  from  Professor  Halstead,  which 
counts  frequency  and  other  parameters. 

Using  ‘he  frequency  counts,  the  entropy  measure  (i.  e.  , f.log  p.)  will  be  ob- 
• 1 1 “ 1 
tamed  and  stored,  together  with  job  card  information,  on  a tape.  The  probabilities 

p^  will  be  obtained  from: 

1.  previously  run  programs 

2.  the  current  programs  which  are  being  tested. 

The  tape  so  generated  will  eventually  be  sorted.  This  tape,  containing  the  com- 
plexity measure,  will  then  be  compared  to  the  error  counts,  and  a correlation  deter- 
mined, if  one  exists. 

Most  of  the  programming  effort  to  date  has  been  spent  on  the  program  needed  to 
handle  the  POWERQ  file.  Methods  were  studied  to  access  and  process  this  file.  The 
program  is  now  almost  ready  and  only  minor  changes  remain.  Further  work  will  be 
described  in  the  future. 

6.  Choice  of  Strategy  in  Software  Revision 

For  a wide  variety  of  reasons,  software  undergoes  changes  and  revisions  during 
its  life  cycle.  Examples  of  such  reasons  are: 

A new  or  changed  specification  has  to  be  incorporated. 

The  host  computer  or  source  language  for  the  software  has  been  changed. 
Peripheral  equipment  has  been  changed. 

Significant  bugs  are  found  in  deployed  software. 

Format  changes  in  input  or  output  are  required. 

The  program  is  to  be  used  as  a module  in  different  software  and  the  inter- 
faces must  be  matched. 

When  confronted  with  such  changes,  the  first  decision  concerns  the  choice  among 
the  following  approaches: 

1.  should  the  code  be  discarded  and  a new  code  written 

2.  should  the  code  be  retained,  but  major  modification  be  rrade. 


1. 

2. 

3. 

4. 

5. 

6. 
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3.  should  only  minor  modifications  be  made. 

The  choice  among  these  three  alternatives  is  predicated  upon:  the  quality  of  the 
software  under  consideration,  the  extent  and  signficance  of  the  required  change,  the 
expected  life-time  of  the  new  software,  and  the  budgeted  resources.  The  selection  of  al- 
ternatives has  to  be  made  on  a technical  and  managerial  basis.  At  present  there  are 
few  objective  tools  to  aid  in  such  decisions. 

3 6> 

Early  work  in  this  area  by  S.  Amster  correlated  various  subjective  and  objec- 
tive measures  of  program  quality.  The  subjective  measures  elicited  opinions  on  the 
extent  of  changes  needed  to  improve  clarity,  decrease  memory  size,  and  decrease 
run-time.  The  objective  measured  included  such  factors  as  number  of  lines  of  code, 
number  of  GoTo's,  number  of  calls,  and  so  on. 

More  recently  M.  Halstead  (Ref.  27,  Chapter  7)  has  classified  several  so-called 
program  "impurities,  " which  detract  from  clarity.  The  elimination  of  these  impurities 
improves  the  program  readability. 

We  are  investigating  the  use  of  Amster's  and  Halstead's  techniques,  as  well  as 
others,  to  produce  a ranking  to  aid  the  designer  and  the  manager  in  their  decision. 

7.  Effect  of  Organizational  Structure  on  Software  Development 

On  all  large  technological  tasks,  the  organizational  structure  of  the  project  team 

has  a large  influence  on  productivity,  reliability,  and  the  quality  o(  the  product.  These 

39 

effects  are  especially  strongly  exhibited  in  software  development.  A major  pheno- 
menon of  interest  is  that  productivity  is  often  not  proportional  to  man  hours.  This 
fact  has  been  observed  by  many  and  articulated  by  Brooks.  In  fact,  at  some  stages 
of  software  development,  adding  workers  to  the  project  slows  the  pace  of  progress. 

We  have  initiated  research  in  this  area  by  studying  the  research  literature  on 
graph  theoretic  models  of  organizational  behavior.  Much  of  the  qualitative  literature 
relates  to  research  done  by  psychologists  prior  to  1950.  This  work,  which  supplies 
graph  structures  for  various  group  organizational  structures,  is  summarized  in  Refer- 
ence 38. 

A simplistic  assumption  is  usually  made  that  programming  productivity  is  a 
direct  function  of  charged  time.  However,  this  is  not  a satisfactory  assumption  because 
charged  time  represents  raw  man  hours  composed  of  personal  time  (coffee  breaks, 
conversations,  etc.  ),  studying  time,  communication  time,  and  lastly,  productive  time. 
We  can  assume  that  the  proportion  of  personal  time  is  fixed  at  say  10%  regardless  of 
the  organizational  structure.  However,  the  remaining  time  divisions  are  highly  depen- 
dent  on  the  organizational  structure.  Preliminary  results  indicate  that  we  are  able 
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to  model  the  communication  links  as  paths  in  the  graph  structure,  and  the  constraints 
(a  fixed  number  of  man  hours  or  a fixed  number  of  programmers).  The  object  is  to 
evaluate  the  merits  of  various  organization  schemes  sc  as  to  minimize  the  need  for 
communication  and  thus  increase  the  productivity. 


A simple  example  illustrates  the  phenomenon  of  increased  man  hours  resulting 
in  decreased  productivity.  Suppose  10  men  work  on  a project,  and  their  total  of  400 
hours  per  week  are  spent  on  40  hours  of  personal  time,  160  hours  of  studying  and  com- 
munications time,  and  200  hours  of  productive  time.  A programmer  is  added  to  the 
group,  and  since  he  is  new,  he  spends  his  40  hours  on  4 hours  of  personal  time,  16 
hours  of  productive  time,  and  20  hours  on  studying  and  communications.  Thus,  he  adds 
16  hours  of  productive  time  to  the  task.  However,  suppose  that  the  integration  of  this 
new  man  into  the  group  requires  1 hour  review  of  the  project  attended  by  all.  This  is  a 
group  loss  of  10  hours  of  productive  time.  In  addition,  the  new  man  is  assigned  to 
spend  a day  (8  hours)  with  an  experienced  team  member  to  help  him  get  started  on  his 
assigned  task.  Thus,  another  8 hours  are  lost,  and  the  net  result  of  this  added  worker 
is  a two  hour  loss. 


Work  is  presently  continuing  on  establishing  further  relationships  among  the 
organizational  structure  and  productivity,  reliability  and  quality,  and  of  designing  a 
realistic  model  of  this  task. 


8.  Collection,  Storage,  and  Retrieval  of  Software  Reliability  Data 

The  collection,  storage,  and  availability  of  reliability  data  is  of  vital  importance 
to  the  worker  in  the  field  of  software  reliability.  The  data  is  needed  by  experimental- 
ists to  suggest  models.  It  is  needed  by  theoreticians  to  test  the  hypothesis  of  their 
postulated  models  and  to  evaluate  their  parameters.  Finally,  it  is  needed  by  managers 
and  designers  as  a data  base  for  design  decisions. 

3 1 

A preliminary  study  supported  by  RADC  discusses  the  scope,  specific  tech- 
niques, and  feasibility  of  establishing  a software  reliability  data  base.  In  the  course 

21 

of  writing  material  on  software  engineering,  the  following  rough  estimates  were  made 
on  the  amount  of  information  generated  annually  in  writing  software  for  the  Air  Force. 

a.  The  total  number  of  bugs/year  is  roughly  between  7.  5 * 10^  and  7.  5 * 10  . 

b.  Assuming  that  ten  data^descriptors /bug  must  be  stored,  one  needs  between 
7.  5 * 10  and  7.  5 * 10  words  of  storage.  All  these  words  may  be  stored 
on  a few  reels  of  tape  and  disks. 

c.  Assuming  microfiche  is  used.for  permanent  backup  storage,  one  needs  be- 
tween 7.5*  10^  and  7.  5 * 10  microfiche. 

Further  details  and  the  underlying  assumptions  are  given  in  Reference  21. 
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9.  Classification  and  Enumeration  of  Program  Errors 

In  order  to  obtain  some  prelimim  ry  quantitative  data  on  the  various  types  of  pro- 
gramming errors,  approximately  1,  000  programs  were  analyzed  manually.  In  this 
work,  described  in  Ref.  1,  we  classified  the  errors  into  three  broad  categories, 
namely,  syntax,  semantic,  and  algorithmic.  Within  these  three  categories,  we  enumer- 
ated 117  different  errors.  Rather  than  listing  individual  numbers  of  occurrences,  we 
rated  the  errors  on  the  scale  from  1 (typifying  a rare  occurrence)  to  5 (for  a frequent 
occurrence). 

The  desired  statistics  were  obtained  from  programs  in  a first  and  in  a second 
programming  course  for  the  following  reasons. 

1.  The  student  programs  were  under  our  control. 

2.  In  a typical  programming  assignment  given  to  a class  of  25  students,  the 
same  program  is  run  approximately  50-100  times  (about  2-4  runs  per 
student,  with  the  extra  runs  for  debugging  or  cosmetic  improvements). 

Since  the  assignments  are  known,  we  know  the  results  and  can  note  the 
presence  of  algorithmic  errors,  a task  that  will  be  impossible  in  a "strange" 
program. 

3.  The  student  programs  are  a good  sample  because  the  number  of  errors  is 
nearly  constant  in  a programming  assignment.  This  is  so  because  at  the 
start  of  the  semester,  even  though  the  students  are  inexperienced,  the 
assignments  are  simple  and  have  a few  lines  of  code.  Later  in  the  semes- 
ter the  students  are  more  experienced  and  make  fewer  errors  per  line  of 
code,  but  the  assignments  are  harder  and  require  more  lines  of  code. 

We  tried  to  gather  various  statistics  on  programming  errors  by  issuing  forms  to 
students  and  asking  them  to  record  various  program  bugs.  However,  this  me  thod  was 
not  successful.  Some  students  were  suspicious  that  these  forms  were  used  for  grading 
purposes.  Other  students  did  not  want  to  admit  to  a high  number  of  errors  or  runs. 
Consequently,  we  abandoned  this  voluntary  method.  What  we  did  instead  was  to  modify 
the  output  for  the  student  programs  so  as  to  produce  tvo  identical  printouts  of  each 
program.  One  was  returned  to  the  student,  and  the  other  was  retained  for  us  in  the 
computer  center.  We  collected  these  duplicate  programs  and  used  them  in  our  analysis. 
This  duplication  was  made  possible  by  the  fact  that  the  students'  programs,  written  in 
the  PL/I  language,  are  compiled  with  the  PLAGO  interpreter  (which  implements  a sub- 
set of  PL/I)  developed  here  and  completely  under  our  control. 

To  compile  the  statistics  we  used  the  form  described  in  Reference  1.  Such  forms 
were  compiled  for  each  student.  From  these  forms  we  enumerated  the  number  of 
different  errors.  For  details  we  again  refer  to  Reference  1. 

The  tediousness  of  the  manual  analysis  and  the  large  volume  of  programs  to  be  re- 
corded lead  to  plans  for  automatic  collection.  Two  such  plans  are  described  in  the  next 
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two  sections. 

10.  Modifications  of  the  PLAGO  Interpreter 

The  manual  classification  of  compile  and  execution  time  errors  was  discussed  in 
the  previous  section.  This  work  indicated  the  need  for  automatic  collection  for  the 
following  two  main  reasons: 

1.  The  volume  of  errors  was  too  large  for  manual  collection  and  reduction. 

2.  The  students'  reporting  data  was  not  always  reliable.  These  forms  often 
reported  fewer  errors  than  actually  occurred,  because  they  thought  that 
such  good  results  would  please  the  instructor. 

Consequently  an  automatic  collection  scheme  was  undertaken.  The  purpose  of 
the  collection  was  to  obtain  the  following  information: 

1.  The  number  of  PLAGO  programs  submitted.  PLAGO  is  a PL/I  dialect  con- 
taining a scientific  subset  of  PL/I,  developed  at  the  Polytechnic  and  used  in 
our  programming  courses.  PLAGO  was  chosen  as  the  subject  language, 
because  it  is  entirely  in  our  control. 

2.  The  number  of  errors  in  each  category.  Such  recording  is  possible,  be- 
cause each  category  is  identified  by  an  error  number.  These  categories 
include  both  compile  and  execution  time  errors. 

3.  Certain  logical  errors  occurring  during  execution.  This  can  be  done  by 
requiring  students  to  use  prescribed  variable  names  in  programming  as- 
signments and  recording  the  results  which  are  different  from  the  correct 
ones. 

All  these  tasks  were  undertaken  as  a Master  Thesis  by  a student.  Unfortunately, 
he  took  a leave  of  absence  which  continues  at  this  time. 

A similar  effort,  using  the  FORTRAN  compiler,  is  described  in  Section  F.  5. 
Nevertheless,  we  hope  to  continue  PLAGO  error  collection  in  the  future  either  with 
the  returning  student  or  with  other  personnel. 

1 1 . Small  Scale  Tests 

In  order  to  study,  verify,  and  measure  parameters  of  the  theoretical  models 
8 9 14 

developed,  ’ ’ small  scale  tests  were  planned  and  begun.  The  tests  consisted  of 

four  tasks,  programmed  by  student  programmers,  with  careful  records  kept  on  their 
debugging  experience.  The  four  tasks  and  the  reporting  form  are  described  in  Refer- 
ence 5 (pp.  22-31). 

The  student  programmers  were  of  sophomore-junior  standing,  with  high  interest 
and  ability  in  programming  topics.  Because  of  this  selectivity  we  believe  their  pro- 
duct to  be  equivalent  to  programs  produced  by  programmers  with  intermediate  exper- 
ience. Consequently,  we  consider  the  obtained  test  data  to  be  representative  of 
normal  practice. 
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The  programmers  were  made  aware  of  the  importance  of  maintaning  careful  and 
truthful  records.  They  were  also  given  the  following  specific  instructions: 

1.  The  problems  were  to  be  analyzed  and  coded,  with  both  analysis  and  coding 
times  recorded. 

2.  The  initial  program  was  to  be  corrected  of  just  the  syntax  errors.  Their 
number,  number  of  runs  needed  for  their  correction,  and  run  times  were 
to  be  recorded.  All  printouts  were  to  be  saved  and  numbered. 

3.  The  programs  were  presented  to  us  (i.  e. , to  either  M.  Shooman  or  H. 
Ruston)  for  inspection.  We  then  asked  the  program  author  and  other 
members  of  the  group  to  debug  each  copy  independently,  recording: 

a.  Number  of  bugs  and  types  found  in  each  debugging  shot. 

b.  Analysis  time  and  computer  time  for  each  debugging  shot. 

c.  History  of  removed  bugs  and  generated  bugs. 

4.  A programmer  who  reached  a blind  alley  had  to  consult  with  us.  He  could 
neither  ask  for  other  help,  nor  abort  the  program. 

5.  The  programs  were  constructed  with  the  following  constraints: 

a.  Structured  design  was  to  be  followed. 

b.  The  instructions  were  to  contain  no  impurities  (Reference  27). 

c.  The  main  program  was  to  be  the  control  structure,  with  cdls  to 
to  modules  (i.  e.  , blocks  or  procedures)  for  various  tasks. 

d.  No  module  was  to  exceed  50  lines. 

The  first  three  programs  were  written  by  five  different  student  programmers. 
Most  have  been  completely  debugged  and  documented.  We  must  still  debug  the  remain- 
ing programs,  record  data  on  the  fourth  program,  and  analyze  the  data. 

12.  Automatic  Programming  Techniques 

This  study  addresses  itself  to  three  questions,  namely, 

a.  Can  a better  applications  program  design  be  obtaine  d if  certain  program 
blocks  are  available  in  a computer  library?  (By  better  we  mean  all  the 
goodness  factors  of  a design  - less  cost,  more  reliability,  etc.,  however, 
we  do  not  necessarily  count  run  time  or  memory  size  as  long  as  they  are 
within  reasons). 

b.  If  the  answer  to  the  first  question  is  in  the  affirmative,  what  blocks  should 
be  written  and  placed  in  the  library, 

and, 

c.  how  is  the  information  about  these  blocks  to  be  transmitted  to  the  program 
designer? 

In  the  research  to  be  described,  we  assume  that  the  answers  to  these  three 
questions  are: 

a.  If  program  blocks  are  already  available,  the  incorporation  of  these  blocks 
certainly  simplifies  the  design  task.  This  statement  is  evidenced  by  the 
fact  that  to  design  around  availalbe  code  is  the  standard  design  technique. 
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A further  advantage  of  reusing  available  code  is  that  inexperienced  junior 
programmers  may  produce  reliable  software,  if  critical  parts  have  been 
designed  by  better  or  more  experienced  programmers.  Also,  many  ap- 
plications programs  are  written  by  personnel  more  experienced  with  the 
application  than  with  programming  techniques,  and  such  a system  might 
significantly  improve  the  quality  of  their  programs. 

b.  The  question  regarding  block  contents  can  only  be  answered  in  a general 
way.  If  we  accept  the  hypothesis  that  there  exists  a high  degree  of  common- 
ality in  classes  of  commercial  and  scientific  applications,  then  a set  of 
blocks  can  be  defined  whose  members  appear  frequently  in  most  designs 

for  such  classes  of  applications.  Clearly,  different  blocks  are  needed  for 
those  writing  payroll  software  than  for  the  designers  of  a wind  tunnel  simu- 
lation. Hence  blocks  must  be  grouped  in  certain  sets  which  form  a library. 
Each  library  applies  to  related  applications. 

c.  The  mode  of  transmittal  of  the  library  is  a crucial  one.  A listing  of 
available  programs  in  some  volume  is  of  little  use  to  an  inexperienced 
programmer,  because  such  programmer  may  not  know,  for  example, 
that  he  can  utilize  program  #123  to  advantage  in  his  design.  Just  the  title 
of  such  a program  (say  Gram -Charlier  interpolation)  may  impact  no  mean- 
ing to  our  designer.  Even  if  this  program  has  a short  description  of  its  use, 
often  such  description  is  for  the  practitioner  rather  than  for  the  tyro. 

In  view  of  these  questions  and  the  stated  answers,  a technique  has  been  investi- 
gated, called  here  automatic  programming.  Our  implementation  of  this  technique  is  an 
interactive  terminal  system  which  allows  on-line  queries. 

This  work,  described  in  Refs.  3,  4,  and  5,  consists  of  two  major  units,  the 
programs  FLOW  and  AUTO.  The  program  FLOW  uses  four  types  of  blocks  to  generate 
a flowchart  of  the  program.  The  blocks  are: 

1.  The  Control  Block.  This  is  a conditional  decision  block  and  is  written  by 
the  programmer. 

2.  The  Functional  Block.  This  is  the  library  block  to  be  used  in  the  design. 

3.  The  Stop  Block.  This  block  signals  the  end  of  the  path  and  is  coded  by  the 
STOP  statement. 

4.  The  User's  Code  Block.  This  block  contains  the  code  written  by  the  pro- 
grammer, exclusively  for  the  application  at  hand. 

Upon  completion  of  the  flowchart  generation,  the  control  passes  to  the  second 
unit,  that  is,  to  the  program  AUTO. 

The  purpose  of  the  program  AUTO  is  to  aid  the  program  designer  in  the  choice 
of  the  library  member.  If  there  are  several  such  suitable  library  programs,  AUTO 
will  help  in  the  selection  of  the  most  appropriate  one. 

How  is  this  done?  The  user  will  specify  to  AUTO  what  he  likes  to  do,  and  AUTO 
will  advise  which  of  the  available  programs  do  this.  AUTO  will  also  tell  the  character- 
istics of  each  method  (e.g.,  will  tell  that  trapezoidal  formula  approximates  each  pair 
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of  points  of  the  function  by  straight  line  segments)  in  response  to  each  query. 

Further  work  on  this  method  shall  be  reported  in  a future  progress  report. 

M.  L.  Shooman  and  H.  Ruston 
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NOTES  ON  n- DIMENSIONAL  SYSTEM  THEORY 
D.  C.  Youla  and  G.  Gnavi 

This  report  makes  three  observations  with  regard  to  several  issues  of  a funda- 
mental nature  that  apparently  must  arise  in  any  general  theory  of  linear  n -dimensional 
systems.  It  is  shown,  by  means  of  three  specific  interrelated  counterexamples,  that 
certain  decomposition  techniques  which  have  proven  to  be  basic  for  n=  1 and  2 are  no 
longer  applicable  for  n>  3.  In  fact,  for  n >3,  at  least  three  equally  meaningful  but 
inequivalent  notions  of  polynomial  coprimeness  emerge,  namely,  zero-coprimeness 
(ZC),  minor-coprimeness  (MC)  and  factor -coprimenes s (FC).  Theorems  1 and  3 
clarify  the  differences  (and  similarities)  between  these  concepts,  and  Theorem  2 gives 
the  ZC  and  MC  properties  a useful  system  formulation.  (Unfortunately,  FC,  which  in 
our  opinion  is  destined  to  play  a major  role,  has  thus  far  eluded  the  same  kind  of 
characterization.)  Theorem  4 reveals  that  the  structure  of  2-variable  elementary 
polynomial  matrices  is  completely  captured  by  the  ZC  concept.  However,  there  is 
reason  to  believe  that  ZC  is  insufficient  for  n>3  but  a counterexample  is  not  at  hand. 
The  matter  is  therefore  unresolved. 

A.  Introduction 

This  report  contains  three  observations  with  regard  to  the  structure  of  n- 
dimensional  linear  systems  plus  a certain  amount  of  peripheral  discussion  concerning 
their  implications  for  System  Theory  in  general. 

In  the  first,  O.,  we  show  by  means  of  a specific  counterexample  that  the  recent 
1 1 

important  result  due  to  Morf,  Levy  and  Kung  on  the  feasibility  of  primitive  factoriza- 
tion for  polynomial  matrices  in  two  variables  does  not  generalize  to  three  or  more 
variables  (n>  3). 

In  the  second,  O^,  we  explore  several  possibilities  for  extending  the  useful  con- 
cept of  coprimeness  of  1 -variable  polynomial  matrices  to  the  n-dimensional  case.  It 
is  found  that  from  the  point  of  view  of  synthesis,  the  situation  changes  profoundly  for 
n^  3 and  that  the  theory  is  beset  with  many  of  the  same  difficulties  encountered  in 
algebraic  geometry. 

In  the  third,  O^,  we  illustrate  the  true  importance  of  the  ZC  concept  by  employ- 
ing it  to  give  a complete  characterization  of  2-variable  elementary  polynomial  matrices 
and  a partial  characterization  for  n>  3. 

Oj . Let  A(z  i , z^,  • • • , z^)  s A(z)  denote  an  mxm  polynomial  matrix  in  the  n 
variables  z i=  1 -♦n,  and  let  d(z)  = det  A(z)  ^ 0.  Suppose  that  d(z)  = dj(z)d^(z)  where 

5 

det  A(z)  = determinant  of  A(z). 
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dj(z)  and  d^(z)  are  both  polynomials.  Then,  for  n^3  it  is  not  always  possible  to  find 
two  polynomial  mxm  matrices  A^(z)  and  A^(z)  such  that  det  A^(z)  = d^z),  i = 1,  2 and 


A(z)  = Aj(z)A2(z)  . (1) 

Proof  (By  counterexample).  Let^  denote  the  collection  of  all  polynomials 

3 4 5 

f(z^,z2,z-j)  in  three  variables  such  that  f(t  , t , t ) = 0.  Clearly,  9 contains 

, 2 , 3,22 

fl  = z2  ' Z1Z3  ; f2  = Z2Z3  - Z1  ; f3  = z3  ‘ Z1  Z2 

Actually,  f ^ , f^  and  f^  generated  in  the  sense  that  9-  (fj.f^.f^),  the  set  of  all  linear 
combinations 

3 

f(z)  = ? a.(z)f.(z)  (3) 

i=  1 

with  polynomial  coefficients  a ^(z),  a2(z),  a^(z). 

Indeed,  any  polynomial  g(z  z^,  z^)  admits  a decomposition 

2 

g(zj»z2,z3>  = zia(z3^  + zlz2b^z3^  + zlc^z3^  + z2d^z3^  + e^z3^  + f(Zj.z2,z3)  <4) 

where  feffj.f^.fj)  and  a,b,c,d  and  e are  polynomials  in  the  single  variable  z,.  For 

f j* 

observe,  that  g is  a constant -coefficient  sum  of  monomials  of  generic  type  Zj  z^  z^ 

and  to  establish  Eq.  (4)  it  is  therefore  only  necessary  to  establish  it  for  every  poly- 
f k 

nomial  z j z^.  Clearly,  the  result  is  true  for  i = k=  0 and  we  proceed  by  induction  on 
both  l and  k. 

Let  us  write  g - h to  indicate  that  g(z^,  z^,  z^)  is  congruent  to  h(Zj,  z^,  z^) 

# i k. 

mod(fj,  f^,  f ) and  suppose  that  z*z2  admits  the  expansion  of  Equation  (4).  Then, 

z{+1  z^  must  be  congruent  to  a sum  of  five  polynomials  of  the  type  z^  a(z^),  z^z^bfZj), 

2 3 2 2 2 

ZjC(z3),  ZjZ.,d(z3),  z je(z^).  However,  from  (2),  Zj-z^z^,  z^z^-z^,  z2-ZjZ^and 

z^Zj-z^Zj.  Hence,  z|  a(z^)  - z^z^a(zj)  and  z^b(z^)  - z^b(z^).  Similarly,  z|z^+^ 

2 2 

is  congruent  to  a sum  of  five  polynomials  of  the  type  z^  z^a(z^),  z^  z2b(z3),  ZjZ2c(z^), 

2 2 2 2 2 2 
z2  d(z3> , z2e(z3).  But  Zj  z2a(z3)  - z3a(z3),  z j z2b(z3)  - z ^ z3b(z3)  and  z^z^)  - z j z3d(z3) 

so  that  in  both  cases  the  result  is  again  of  the  form  Equation  (4).  Thus,  Eq.  (4)  is 

verified. 

Tx 


In  other  words,  the  difference  g-h  e (f  ^ , f2 , f3) . 
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Now  assume  that  ge  9.  Then, 

0 = g(t3,  t4,  t5)  = t6a(t5)  + t?b(t5)  + t3c(t5)  + t4d(t5)  + e(t5)  . 


(5) 


Since  for  arbitrary  choices  of  integers  m,  n,  p,  q and  r 

6 + 5m  / 7 + 5n  / 3 + 5p  / 4 + 5q  / 5r  , (6) 

any  cancellation  of  terms  between  pairs  of  polynomials  in  Eq.  (5)  is  impossible  whence, 

a(t3)  = b(t5)  = c(t5)  = d(t5)  = e(t5)  = 0 . 


Consequently,  a(z3)  = b(z3)  = c(z3)  = d(z3)  = e(z3)  h 0 and  gUj,  z2>  z3) 
f(z  j , z2>  z3)  e (f  ^ , f^,  f'3),  as  was  to  be  shown 

By  direct  calculation, 


Let 


,2  , , ,5  - 2 x 3^3. 

lZ  ~ fl£3  = zl(zl  ' 3zlz2z3  + zlz2+  Z3> 


E(z  j , z2, z3)  = 


(7) 


f 

f. 

2 

1 

f3 

*7 

(8) 


and  assume  that  F = F j F2  where  Fj(Zj,z2>z3)  and  F^z^z^z^  are  2x2  polynomial 
matrices , 


and 


det  F ^ = d j = z j 


, , „ . 5 2 , 3,3 

det  F2  = d2=zl  - 3Z1Z2Z3  + Z1Z2+  z3 


(9) 


(10) 


*#  345  .345  3453, 

Clearly,  F(t  , t , t ) = 0^  implies  that  F2(t  , t , t ) s 0^  since  det  F (t  , t , t ) = t ^ 0. 

This  means  that  all  four  entries  in  F7  belong  to  from  which  we  conclude  that 
2 Z 

d2=  de\.F^c9  . But  from  what  has  already  been  shown,  9-  (f^,f2,f3)  and  it  is  evident 
that 


2,2  322.2 

9 = (z2  - z,z„z,z,  - z,,z,  - z,z,) 


(ID 


T31  2 3 “T  ~3  “1“2' 

cannot  contain  any  polynomial  whose  expansion  into  monomials  exhibits  a term  of  degree 


detF  = f2-f1f3  = d1d2. 


lm  is  the  mxm  identity  matrix  and  0,^  r.  0 denote  the  mxr  and  mxm  zero 
matrices,  respectively. 
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less  than  four.  The  factorization  F = F F^,  as  described,  is  therefore  impossible, 
Q.E.D. 

Comment.  The  failure  of  deterrninantal  factorization  for  n>  3 entails  some 
perplexing  consequences,  at  least  from  the  circuit  theorist's  point  of  view.  As  an 
illustration,  consider  an  mxq  polynomial  matrix  A(z)  whose  normal  rank  r is  less 
than  either  m or  q.  Then,  in  general,  it  is  no  longer  possible  to  find  two  poly- 
nomial matrices  A^(z)  and  A^(z)  of  respective  sizes  mxr  and  rxq  such  that 

A(z)  = A1(z)A2(z)  . (12) 


In  fact,  it  is  easy  to  construct  such  an  example. 
Let 


A(z)  = 


f- 

f,  i 

0 

0 

2 

1 

F(z) 

f. 

f 

do 

do 

3 

2 

2 

2 

0 

dl 

° dl 

f2 

(13) 


where  fj.f^.f^  are  given  in  Eq.  (2)  and  d^,d2  are  defined  by  Equations  (9)  and  (10). 
Then,  det  F = djd^  and  det  A(z)  = 0 so  that  normal  rank  A(z)  = 2.  Suppose  that 


A(z) 


a b 


(14) 


where  F^,  F^  are  2x2  polynomial  matrices  and  a(z),  b(z),  c(z),  d(z)  are  four  scalar 
polynomials.  From  Equations  (13)  and  (14), 


F = FjF^  ; ac  + bd  = f 


(15) 


According  to  the  counterexample  (and  the  ir reducibility  of  both  d^  and  d^),  either 
Fj(z)  or  F2(z)  must  be  elementary.  Let  it  be  F.,(z).  Then,  without  loss  of  generality 
we  can  assume  that  F2(z)  = 1^  and  this  choice  yields  a(z)  s 0,  b(z)  = d^(z)  and 
dj(z)d(z)  = f2(z).  But  = zi  does  not  divide  f^(z)  = z^z^  " z\ • Similarly,  if  F^(z) 

is  elementary,  d2(z)  must  divide  f2(z)»  another  contradiction,  Q.E.D. 

5 ” 

The  authors  have  constructed  several  other  counterexamples  but  the  one  given 
above  possesses  an  appealing  amount  of  structure.  (The  polynomials  fi,f2>l3  have 
already  been  used  in  algebraic  geometry  to  give  a concrete  example  of  a prime  ideal 
9 = (f i » f 2 > ^3^  w^ose  square  is  not  primary.  ^ It  follows,  that  in  any  polynomial 
decomposition  of  F(z)  into  a product  of  two  square  matrices,  at  least  one  of  the  factors 
must  be  elementary;  i.e.  , must  possess  a determinant  which  is  a nonzero  constant. 

A(z)  has  normal  rank  r if  the  largest  size  minor  that  does  not  vanish  identically  is  rxr. 


1 


; 
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On  the  Coprimeness  of  Polynomial  Matrices 


Experience  with  1 -variable  synthesis  procedures  appears  to  reinforce  the  con- 
viction that  rank  degeneracies  can  always  be  eliminated,  a priori,  by  the  use  of  some 
suitable  preamble  which  extracts  superfluous  elements.  In  this  process,  the  avail- 
ability of  a rank-reducing  polynomial  factorization  scheme  such  as  Eq.  (12)  often  plays 
an  important  role.  However,  since  such  a technique  is  no  longer  applicable  for  n > 3, 
it  is  necessary  to  study  the  entire  matter  in  greater  depth  and  this  we  proceed  to  do  in 
our  second  observation. 


C>2’  Ihe  subtlety  of  the  n-dimensional  problem  becomes  quickly  apparent  when 
one  attempts  to  frame  a notion  of  polynomial  coprimeness  which  is  all-inclusive.  A 
little  thought  reveals  that  there  are  at  least  three  possible  meaningful  definitions  which 
although  equivalent  for  n = 1 and  partly  equivalent  for  n=  2,  are  completely  inequivalent 
for  n>3.  (This  divergence  is  undoubtedly  responsible  for  much  of  the  complexity 
which  characterizes  the  theory  of  contact  of  algebraic  curves.) 

Definitions . Let  A(z)  and  B(z)  denote,  respectively , an  mxq  and  mxf  polynomial 
matrix,  q+f  >m^l,  and  let 

C(z)  = [ A(z)  |B(z)]  . (16) 

Then,  1)  the  pair  A(z),  B(z)  is  said  to  be  zero  left-coprime  (ZLC)  if  there  exists  no 
n- tuple  z = (z  ^ , z^ , . . . , z^)  which  is  a zero  of  all  the  mxm  minors  of  C(z),  2)  minor 
left-coprime  (MLC)  if  these  minors  are  relatively  prime,  and  3)  factor  left-coprime 
(FLC)  if  in  any  polynomial  decomposition  C(z)  = Cj(z)C^(z)  in  which  Cj(z)  is  square, 
Cj(z)  is  necessarily  elementary.  (In  dual  fashion,  A(z)  and  B(z)  are  zero  right-coprime 
(ZRC),  etc.  , if  the  matrix  transposed  pair  A'(z),  B^z)  iz  zero  left-coprime,  etcetera.) 

Theorem  1 . For  n = 1 the  three  definitions  are  equivalent;  i.e.,  ZLC  s MLC  = 
FLC.  For  n=  2,  ZLC  4 MLC  = FLC  and  for  n=  3,  ZLC  4 MLC  4 FLC.  Always, 

Z LC  -*  MLC  ->FLC. 

4 

Proof.  That  the  three  definitions  are  equivalent  for  n=  1 is  well-known  and  is 
directly  attributable  to  the  fact  that  every  ideal  of  polynomials  in  one  variable  is 
principal.^  Since  the  polynomials  A(z)=z^  and  B(z)  = z^  possess  the  common  zero 
z=(0,0)  but  are  relatively  prime,  the  ZLC  and  MLC  concepts  differ  for  n>2.  Never- 
theless, it  is  proved  in  Theorem  3 that  for  n=  2,  a pair  A(z),  B(z)  is  MLC  iff  it  is 
FLC.  For  n=3  the  situation  changes  drastically. 


One  or  more  polynomials  are  said  to  be  relatively  prime  if  their  greatest  common 


polynomial  divisor  (g.c.d.)  is  a nonzero  constant. 
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: 


Consider  the  pair  A(z)  = F(z)  and  B(z)  = where  F(z)  is  once  again  defined 

by  Equation  (8).  It  is  easily  seen  that  the  greatest  common  divisor  (g.c.d.)  of  the 
six  2x2  minors  of 


C(z) 


[F(z) 


!1  *2^ 


(17) 


is  precisely  Zj(=d^).  Thus,  the  two  matrices  F(z)  and  z^l2  are  neither  Z LC  nor  MLC 
but,  as  we  now  show,  they  are  FLC.  Indeed,  suppose  that  C(z)  admits  a polynomial 
decom  position 


C(z)  = C1(z)[C21(z)|C22(z)] 


(18) 


where  Cj  and  C2J  are  both  2x2.  Then,  F = C jC^  and  either  Cj  or  C2J  must  be  ele- 
mentary. If  C,.  is  elementary,  detC.  = constant  . det  F = constant  • z.d_  and  from 
Li  5*2  33  1 c 

Cauch-Binet  it  follows  that  d2  = z^  - Iz^z^^  + ZjZ2  + z ^ must  divide  the  g.c.d.  Zj,  a 

contradiction.  Hence,  C^(z)  must  be  elementary  and  the  pair  F(z),  z^l2  *s  FLC.  In 
summary,  the  three  properties  are  inequivalent  for  n^3.  Furthermore,  it  is  obvious 
that  foralln>l,  Z LC  -*  MLC  ->  F LC,  Q.  E.  D. 

Theorem  2.  1)  the  mxq  and  mxf  polynomial  matrices  A(z)  and  B(z),  q+f  >m  > 1 , 
are  ZLC  iff  there  exist  two  polynomial  matrices  X(z)  and  Y(z)  such  that 

A(z)X(z)  + B(z)Y(z)  = lm  . (19) 

2)  They  are  MLC  iff  for  every  i = 1 -»n,  there  exist  polynomial  matrices  X.(z) 
and  Y.(z)  such  that 

l 

A(z)X.(z)  + B(z)Y.(z)  = *.(z)lm  , (20) 

where  v)j.(z)  is  a nontrivial  scalar  polynomial  independent  of  the  variable  z..  Moreover, 
if  A(z)  and  B(z)  are  real,  X(z),  Y(z)  and  X.(z),  Y.(z),  4/.(z),  i = 1 -*n,  can  always  be 
constructed  real. 

Proof.  1)  Clearly,  Eq.  (19)  guarantees  that  rank  C = rank  [A|  B ] =m  for_all  z and 

this  implies  that  n£  z - ( z j » z 2 zn)  is  a cornrnon  zero  a11  the  mxm  minors  of 

C(z).  Thus  Eq.  (19)  is  sufficient  for  ZLC.  To  prove  necessity  we  employ  a novel 
technique  which  succeeds  in  isolating  each  individual  mxm  minor  of  C(z). 

Let  the  pair  A(z),  B(z)  be  ZLC  and  let  A.  . (z)  denote  the  mxm  minor  of 

1 12'  "‘m 

C(z)  formed  with  the  given  m rows  and  the  m distinct  columns  numbered  i j , i2>  • • • , 
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q-{-| 

From  the  definition  of  ZLC,  these  n Cm  polynomials  are  devoid  of  any  common  zeros 

and  invoking  a classical  result  due  to  Hilbert,  there  exist  polynomials  a.  . (z) 

, , 1ll2‘ ’ "lm 

such  that 


1 = £ a . (z) A.  (z)  . 

...  1 2 m 12  m 

(i) 

In  addition,  the  a's  can  all  be  chosen  real  if  all  the  A's  are  real. 


(21) 


Pick  K to  be  any  (q+i  )xm  real  constant  matrix  whose  mxm  minors  ^ ^ ) 

Of 

are  all  nonzero,  introduce  q+i  extra  independent  variables,  X^,  X^,  • • • > X^+^  and 


A(X)  = diag  [Xt , X2>  . . . , Xq+^  ] 


(22) 


The  polynomial  matrix 

D(z,  X)  = C(z)  A(X)K 

is  mxm  and  from  the  Cauchy -Binet  theorem. 


(23) 


A(z,X)  HdetD(z.X)  = V X X . . . A (z)k()’  ?’  ’ 

(i  l 

Thus , for  every  one  of  the  C^  m -tuples  (i)  = (i  ^ , i ^ , . . . , i ) , 

(ll,l2 lm\  _ 5mA(z,  X) 


(24) 


, x„fVlZ m\  o A(z,  X) 

(Z>K1  1,  2 m/~  ?X.  5X.  . . . 9X. 

12m  l,  i.  x 


1 2 


(25) 


(X)=0 


Let  D (z,X)  denote  the  mxm  polynomial  matrix  adjugate  to  D(z,X).  Since 


A(z,X)l  = D(z,X)D  (z,X)  , 

III  d 


(26) 


multiplication  of  both  sides  of  Eq.  (23)  on  the  right  with  Da(z,X)  yields 


A(z,X)l  = C(z)A(X)KD  (z,X)  . 

1 1 ) d 


(27) 


In  view  of  Eq.  (25),  Eq.  (27)  permits  the  identifications 


A.  . . (*)1  = C(z)Z.  . . (z) 

1,1.. ..i  m • ’ — ’ 

12  m 


l . i,.  . . l 

12  m 


(28) 


Such  K's  always  exist  for  q+f  >m.  A diagonal  kxk  matrix  with  diagonal  elements 
a ^ , a^,  . . . , a^  is  written  diag  [a  j , a2>  . . . , a^]. 
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where  for  all  (i). 


Z.  . . ( 

l.i,  ...  1 

12  m 


5,u(MX)KDa(z,M) 

ax.,  ax.  ...  ax. 


• • • i i v d\.  d\.  • • • d\. 

\ ll  l2 

1 , 2,  . . • , m / 


1 (X)  = 0 


is  (q+i  )xm  and  polynomial.  Finally,  by  combining  Eqs.  (21)  and  (28)  we  reach  the 
desired  result  Equation  (19), 

C(z)Z(z)  = A(z)X(z)  + B(z)Y(z)  = 1 (3 

m ' 

where 


Z(z)  = V a . 


1 ll2 
(i)  1 L 


■ (z)Z.  . . ( 

i l.i,  ... i ' 

m 12  m 


, X(z) 

- • 


An  examination  of  the  above  procedure  reveals  that  Z(z)  is  always  real  if  C(z)  is  real. 

& 

2)  Suppose  that  the  pair  A(z),  B(z)  satisfies  Eq.  (20)  for  every  i = 1 ->n.  Then, 

by  Cauchy -Binet,  the  g.c.d.  of  the  mxm  minors  of  C = [A  | B ] must  divide  every  4>.(z). 

Since  iL(z)  is  nontrivial  and  independent  of  z.,  i = l-»n,  this  g.c.d.  must  be  a nonzero 
1 q+f  1 

constant  and  the  C^  A's  are  therefore  relatively  prime.  Hence,  A(z)  and  B(z)  are 
MLC,  The  necessity  of  Eq.  (20)  is  also  easily  established  with  the  aid  of  Equation  (28). 

By  definition,  A(z)  and  B(z)  are  MLC  if  the  A's  form  a relatively  prime  set  of 

3 

polynomials.  But  then,  according  to  another  classical  result,  for  every  i = 1 -*  n 

there  exist  polynomials  a.  . (z;i)  such  that 

12  m 

Zai,i-...i  (z;i)Ai,i,...i  (z)  = ^(z)  (32) 

jlj  1 2 m 12  m 

where  4^(2)  is  nontrivial  and  independent  of  z,. 

As  before,  the  a's  can  be  chosen  real  if  all  the  A's  are  real.  Thus,  combining 
Eqs.  (28)  and  (32), 

C(z)Z.(z)  = A(z)X.(z)  + B(z)Y.(z)  = 4/.{z)lm  (33) 


where 


„ fx. 

Z.(z)  = 'a.  . (z;i)Z  . (z)  = __1 

1'  Lt  l.x,...i  ' 1,1,  ••  .1  ' ' Y 

...  1 z m 1 2 m [_  1 


Xt(z) 

yytt 


A more  constructive  approach  is  given  in  the  proof  of  the  MLP  lemma  preceding 
Theorem  4. 
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The  proof  of  Theorem  2 is  complete,  Q.  E.  D. 

Corollary . Let  A ^ ^ (z) , A^  2(z)  and  A^j(z)  be  three  polynomial  matrices  of  com- 
patible sizes  such  that  A2 } (z)A  ~ j (z)  A } 2(z)  is  polynomial.  Then,  if  the  pair  An,A12 
is  MLC,  A21(z)At|(z)  is  polynomial  and  if  the  pair  A^,  A21  is  MRC,  a'|(z)A12(z)  is 
polynomial. 

Proof.  Assume  that  the  pair  Aj  , A 2 is  MLC.  Then,  according  to  Theorem  2, 
part  2),  for  every  i=  l-»n,  there  exist  polynomial  matrices  X^{z),Y.(z)  such  that 

AII(z)xi(z)  + A12(z)Y.(z)  = 4-.(z)l  (35) 

where  4c(z)  is  nontrivial  and  independent  of  z..  Thus, 

A21(z)X.(z)  + A21(z)A‘J(z)Al2(z)Yi(z)  = A21(8)Aj[(*)ii».(z)l  , (36) 

and  from  the  premise  it  follows  that  the  product  A21(z)A  j j(z)4c  (z)  is  polynomial  for 
every  i=  l-»n.  In  other  words,  every  denominator  of  every  nontrivial  entry  in 
A2j(z)Ajj(z)  divides  every  + (z)  and  is  therefore  independent  of  every  z.,i=  1 -»n. 

Ihis  is  only  possible  if  all  such  denominators  are  nonzero  constants  and  A2^(z)A”j(z) 
is  polynomi  1,  Q.E.D.  (A  dual  argument  handles  the  case  Aj  j,  A^  MRC.) 

Let  us  return  to  the  problem  of  factorizing  an  mxq  polynomial  matrix  A(z)  whose 
normal  rank  r<  min(m,q).  By  an  appropriate  interchange  of  rows  and  columns  we  can 
always  assume  that 


A(z)  = 


where  det  A^(z)  ^ 0.  Hence,  by  exploiting  a well  known  matrix  result, 

AZZ^  ~ A21(z)A"}(z)A12(z)  (38) 

from  which  it  is  immediately  verified  that 

A(z)  = Aj(z)A2(z)  = A3(z)A4(z)  (39) 

where 

if 

The  method  employed  in  the  proof  of  Theorem  2 can  also  be  used  in  the  1 -variable 
case  and  avoids  any  appeal  to  the  Smith  canonic  form. 


r 

q-r 

An(2) 

Ai2(z) 

AZl(z) 

a22(z) 

u(z)  i o. 

Hence, 
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1 A,  , 

A (z)  = ^ ; A (z)  = *1 

A2iAi!  A2> 


aj(z)  = [A,  j Iai2^  ' a4(z)  = tlr  I 1 a12^  • (41) 

According  to  the  corollary  to  Theorem  2,  if  the  pair  is  MLC  or  if  the 

pair  A.  jjA^j  is  MRC,  Eq.  (38)  is  sufficient  to  assure  that  at  least  one  of  the  two 
breakdowns  in  Eq.  (39)  is  completely  polynomial.  For  n = 1 and  2 this  minor- 
coprimeness  can  always  be  accomplished  by  either  extracting  a square  polynomial 
factor  from  the  left  of  A^fz)  or  from  the  right  of  A^(z)  and  using  the  polynomial 
quotients,  but  for  n>3  the  inequivalence  of  MC  and  FC  precludes  any  such  general 
possibility. 

Theorem  3.  For  n = 2,  a polynomial  pair  A(z),  B(z)  is  MLC  iff  it  is  FLC. 

Proof  (Neces sity) . Let  the  pair  A(z),B(z)  be  MLC  but  not  FLC.  Then, 

C(z)  = [A(z)|B(z)]  admits  a polynomial  decomposition  C(z)  = CjfzJC^z)  wherein 
C^(z)  is  square  and  non -elementary.  Since  MLC  implies  that  normal  rank  C(z)  =m, 
detC^(z)  is  a nonconstant  polynomial  which  divides  all  the  mxm  minors  of  C(z),  a 
contradiction. 

Sufficiency.  Let  A(z)  and  B(z)  be  FLC  and  assume  first  that  normal  rankC(z)  =m. 
Let  the  g.  c.d.  4H2)  of  the  mxm  minors  of  C(z)  possess  a linear  divisor  z^-T\,  T|  a 
constant.  Then,  normal  rank  C(ri,z7)<  m and  there  exists  an  mxl  polynomial  column- 
vector  x(z2)  in  the  single  variable  z^  such  that 


x'(z2)C(r\,zz)  ^ 0^+| 


Obviously,  there  is  no  loss  of  generality  in  assuming  that  the  m components  of  x(z^) 
are  relatively  prime. 

Thus,  x(z2)  can  *3e  incorporated  into  the  first  column  of  an  mxm  elementary 
polynomial  matrix  X(z2)  and  it  follows  from  Eq.  (42)  that  the  first  row  of 

D(z)  = X'(z2)C(z)  (43) 

vanishes  identically  in  z^  for  z j = T| . Consequently,  if 

5 / 

Column-vectors  are  written  a,x,  etc.,  and  A denotes  the  matrix  transpose  of  A 

* • 

A + B is  the  "direct"  sum  of  A and  B. 
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A(z1)  = («1-T|)l1+  . 

the  matrix  C2(z)  = A ^(z^)D(z)  is  polynomial  as  is  the  decomposition 
C(z)  = (X,(z2))_1A(z1)C2(z)  = Cj(z)C2(z) 


(44) 


(45) 


But 

detC^(z)  = constant  • (Zj-T))  , (46) 

which  is  incompatible  with  the  FLC  assumption.  Similarly,  i)/(z)  can  possess  no  linear 
divisors  of  the  form  £ a constant.  In  short,  any  irreducible  factor  of  vJj(z)  must 

be  a 2-variable  polynomial.  (It  is  also  important  to  observe  that  if  C(z)  is  square,  the 
factor  Cj(z)  in  Eq.  (45)  can  be  extracted  from  the  right  as  well  as  from  the  left.) 

Since  the  entries  in  C(z)  can  be  viewed  as  polynomials  in  z7  with  coefficients  that 

1 ^ 

are  polynomials  in  z^,  it  is  easily  shown  that  there  exists  a square  polynomial  matrix 
V(z)  whose  determinant  is  a nontrivial  polynomial  in  the  single  variable  z ^ , such  that 

C(z)V(z)=  . ,47) 

By  Cauchy-Binet,  ^(z)  divides  detE(z).  With  the  aid  of  the  factorization  technique 
described  above,  it  is  possible  to  effect  a polynomial  decomposition 

E(z)  = C^zJFfz)  (48) 

in  which  detCj(z)  is  devoid  of  any  linear  divisors  of  the  form  Zj-T|  and  detF(z)  is  a 
nontrivial  polynomial  in  the  single  variable  Zj.  Clearly  then,  because  i|j(z)  is  free  of 
all  1-variable  divisors,  it  must  divide  detCj(z). 

Thus, 

C(z)  = C1(z)[F(z)|Om  q+i  _m]V_1(z)  = C^zJC^z)  . (49) 

The  matrix 

C2(z>  = CF(lHOmj(J+(  ,50) 

is  polynomial.  In  fact,  C2(z)  = M(z)/g(Zj)  where  M(z)  is  polynomial  and  g(Zj)  is  a 
nontrivial  scalar  polynomial  in  the  single  variable  Zp  Since  C(z)  is  polynomial,  any 


This  key  observation  is  a special  case  of  lemma  2.  1,  Reference  1. 
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linear  factor  Zj-r|  of  g(z^)  must  divide  the  product  C^(z)M(z);  i.e. 

C.(r),  z_)M(ti,  zD  =0 
1'  1 2'  ' " 2'  m , q+jf 


But,  being  free  of  the  divisor  z j - T) , det  Cj(r|,  z^)  r 0 whence, 

“I'W  <52> 

and  the  polynomial  Zj-T)  divides  MJz^jZp).  Since  all  such  linear  factors  can  be  cancelled 
by  a repetition  of  this  cycle,  g(Zj)  itself  must  divide  M(z)  and  C^(z)  is  therefore  poly- 
nomial, as  stated. 

By  invoking  the  FLC  property  it  is  now  concluded  that  Cj(z)  is  elementary,  ij'(z) 
is  a nonzero  constant  and  the  pair  A(z),B(z)  is  MLC.  Finally,  to  complete  the  proof 
of  Theorem  3 it  must  be  shown  that  the  FLC  hypothesis  excludes  the  possibility  that 
normal  rankC(z)  < m. 

Let  normal  rank  C(z)  = r<  m.  Then,  for  some  choice  of  square  constant  permuta- 
tion matrices  P and  Q, 

r q+f  -r 


PC(z)Q  = 


A^jTz) 


q+f  -r 

A1  2(z) 

r 

A9  2^^ 

m - 

r 

A22(z)  = 

A21 

(z)A 

that  the 

pair 

A11 

where  det  A ^ ^ (z)  ^ 0 and  A^^(z)  - A2i(z)A|j(z)A12(z).  It  should  be  clear  that  A(z)  ,B(z) 
FLC  implies,  a fortiori,  that  the  pair  A ^ ^(z) , A 2(z)  is  also  FLC.  But  then,  from  what 
has  already  been  proved,  since  the  rx(q+f  ) matrix 

A2(z)  = [An(z)|A12(z)]  (54) 

has  normal  rank  r,  A^  j(z)  and  A^2(z)  must  be  MLC  and  according  to  the  corollary  to 
Theorem  2,  A2^(z)A^j(z)  is  polynomial.  Thus,  the  factorization 


C(z)  = P' 


. A - 1 m , m -r 
A2 1 A 1 1 


m -r , q+f 


Q = C j(z)  • Cp(z) 


is  polynomial  but  Cj(z)  is  square  and  non-elementary,  a contradiction,  Q.E.D. 

(There  is  no  doubt  that  the  equivalence  of  MLC  and  FLC  for  n = 2 goes  a long  way 
towards  explaining  why  certain  1 -variable  synthesis  ideas  admit  reasonably  successful 
2-dimensional  generalizations.) 


r 
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O^.  Imbedded  in  the  sufficiency  part  of  the  proof  of  Theorem  3 is  a simple 
algorithm  for  the  removal  of  any  1-variable  divisor  of  the  g.c.d.  by  the 

extraction  of  appropriate  matrix-polynomial  factors.  It  is  distinguished  mainly  by 
its  explicit  use  of  the  known  result  that  any  polynomial  row-vector  in  one  variable 
whose  components  are  relatively  prime  can  be  incorporated  into  the  first  row  of  an 
elementary  polynomial  matrix.  This  strongly  suggests  that  an  understanding  of  the 
structure  of  n-dimensional  elementary  polynomial  matrices  can  provide  insight  and 
guidance  in  the  formulation  and  solution  of  the  general  synthesis  problem.  Theorem  4 
gives  a complete  answer  for  n=  2 and  a partial  answer  for  n>  3 with  the  aid  of  the 
zero  left-prime  concept.  It  also  makes  essential  use  of  an  auxiliary  lemma  of  con- 
siderable interest  in  its  own  right. 

$ 

The  MLP  lemma.  An  mxr  polynomial  matrix  C(z),  m < r,  is  MLP  iff  for  every 

i=  l-»n,  it  can  be  incorporated  into  the  first  m rows  of  a polynomial  rxr  matrix  Cg(z) 

whose  determinant  ^ Is  nontrivial  and  independent  of  z.  . Moreover,  C (z)  can  be 
l i e 

constructed  real  if  C(z)  is  real. 

Proof.  (Sufficiency) . Suppose  that  such  an  enlargement  is  possible  for  every 
i=  1 -»n.  Then,  from  the  adjugate  identity 

Ce(z)(Ce(z))a  = +.(z)lr  (56) 

we  obtain,  for  every  i=  l->n,  an  rxm  polynomial  matrix  Z^(z)  which  satisfies 

C(z)Z.(z)  = i|i.(z)lm  (57) 

and  the  MLP  character  of  C(z)  follows  by  exactly  the  same  reasoning  employed  in 
part  2)  of  Theorem  2. 

Neces sity . For  every  fixed  i,  1 < i < n,  the  entries  in  C(z)  can  be  viewed  as  poly- 
nomials in  z.  with  coefficients  that  are  polynomials  in  the  remaining  n-1  variables. 
Hence,  by  adapting  the  standard  1 -variable  argument  used  to  derive  the  Smith  form, 
it  can  be  asserted  that  there  exists  an  rxr  polynomial  matrix  V(z)  whose  determinant 
d(z)  is  nontrivial  and  independent  of  z^  such  that, 

C(z)V(z)  = [E(z)|Omr_m]  . (58) 

Let  Cj(z)  denote  the  matrix  formed  with  the  last  r-m  rows  of  the  adjugate  of  V(z). 

£ 

ZLP,  MLP  and  FLP  abbreviate  zero  left-prime,  minor  left-prime  and  factor 
left-prime,  respectively.  Their  meanings  are  of  course  identical  with  those  given 
in  the  previous  definitions  for  ZLC,  MLC  and  FLC  of  C(z)  = [A(z)|B(z)]. 


1 


V(z)  and  E(z)  can  be  constructed  real  if  C(z)  is  real. 
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1 


(59) 


(60) 


(61) 


(62) 


(63) 


and  by  Cauchy -Binet,  det  E(z)  divides  the  g.c.d.  of  the  mxm  minors  of  the  left-hand 
side  of  Equation  (63),  By  hypothesis  C(z)  is  MLP  and  this  g.c.d.  equals  dm(z).  Con- 
sequently, detE(z)  and  therefore  detCe(z),  are  both  independent  of  z.,  Q,  E.  D. 

Theorem  4.  Let  C(z)  denote  an  mxr  polynomial  matrix,  m < r.  Then,  1)  in  order 
that  C(z)  be  incorporable  into  the  first  m rows  of  an  rxr  elementary  polynomial  matrix 
Ue(z)  it  is  necessary  that  it  be  ZLP. 

2)  For  n=  2 the  ZLP  condition  is  also  sufficient. 

3)  For  n >3  the  ZLP  condition  is  sufficient  if  it  can  be  proved  to  suffice  for 

m = 1 . 

Proof.  1)  Clearly,  any  common  zero  of  all  the  mxm  minors  of  C(z)  must  appear 
as  a zero  of  the  determinant  of  any  rxr  polynomial  matrix  which  includes  C(z)  in  m of 
its  rows.  Thus,  no  such  matrix  can  be  elementary  and  the  ZLP  property  is  therefore 
necessary. 

2)  Let  C(z)  be  ZLP  and  suppose  that  n=  2.  Then,  from  the  ZLP  property  alone, 
it  follows  from  Theorem  2,  Part  1),  that  there  exists  a 2-variable  rxm  polynomial 
matrix  Z(z)  such  that 
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C(z)Z(z)  = 1 


Evidently  Eq.  (64)  implies  that  C(z)  and  Z(z)  are  MLP  and  MRP,  respectively. 

Thus,  invoicing  the  MLP  lemma  (and  its  dual),  there  exist  polynomial  matrices 
Cj(z)  and  Zj(z)  of  sizes  (r-m)xm  and  rx(r-m)  such  that  the  determinants  of  the  two 


rxr  matrices 


Ce(z)  = ’ Ze(z)  = Uz(z)  | Z i(Z)3 

are  independent  of  z^  and  z^,  respectively;  i.e., 
detCe(z)  = f(z2)  , detZe(z)  = gtZj) 

By  direct  multiplication, 


Ce<z>Zetz) 


1 I CZ. 

m 1 


or,  after  performing  some  obvious  row  and  column  operations  which  wipe  out  CjZ  and 
CZj  (and  leave  determinants  unchanged), 


C "1.  [Z  | 1 -ZC)  Z 1 . 

:j(lr-ZC)  r 1 ■ + Cl(1r'ZC,Zl  • 


Let  L(z)  = C^(lr  -ZC)Zj.  It  follows  from  (68)  that  detL(z)  = f(z2)g(Zj),  a product 
of  two  1 -variable  polynomials!  It  is  therefore  possible  to  employ  the  factorization 
algorithm  for  the  extraction  of  1 -variable  determinantal  factors  (described  in  the  proof 
of  Theorem  3)  to  effect  a decomposition  L(z)  = L^(z)L^(z)  where  L2(z)  and  Lj(z)  are 
both  (r-m)x(r-m),  det  L2(z)  = f(z2)  an^  detL^(z)  = g(Zj).  By  the  same  argument  used 
to  establish  the  polynomiality  of  C,(z),  Eq.  (50),  Theorem  3,  it  follows  that  Z_  = 

_ i _ i c £■ 

(1  -ZC)Z  L and  C2  = L C ( 1 - ZC)  are  both  polynomial.  But  then,  Eq.  (68)  gives 
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3)  It  is  the  authors'  belief  that  ZLP  is  no  longer  sufficient  for  n>3  but  a counter- 
example is  not  at  hand.  Nevertheless,  as  a positive  contribution  we  have  developed  a 
simple  recursive  method  which  proceeds  by  induction  and  shows  that  all  the  difficulty 
resides  in  the  case  m=  1. 

Suppose  therefore  that  ZLP  is  sufficient  for  all  m < s.  We  prove  that  it  also 
suffices  for  m=s+l.  Let  the  (s+l)xr  polynomial  matrix  C(z),  s+l<r,  possess  the  ZLP 
property.  Then,  a fortiori,  the  sxr  polynomial  matrix  C^z)  formed  with  the  first  s 
rows  of  C(z)  is  also  ZLP  and  by  the  induction  hypothesis  there  exists  an  rxr  elementary 
polynomial  matrix  F(z)  which  incorporates  C^(z)  into  its  first  s rows. 

Let  c (z)  denote  the  last  row  of  C(z).  Clearly,  the  solution  x7(z)  of  the  equation 

x'(z)F(z)  = c'(z)  (71) 

is  polynomial.  Write  x'  = [x'  |x^  ] where  x'  is  lxs  and  x'  is  lx(r-s).  Then,  because 
of  the  mode  of  construction  of  F(z)  and  the  choice  of  x/(z), 


x'(z) 


s , r-s 


x'(z) 


F(z)  = C(z) 


(72) 


It  is  not  difficult  to  see  from  Eq.  (72)  that  any  common  zero  of  the  r-s  polynomial 
components  of  x^(z)  is  necessarily  a zero  of  all  the  (s+l)x(s+l)  minors  of  C(z).  Hence, 
x^(z)  must  be  ZLP. 

By  the  induction  hypothesis,  x^(z)  can  be  incorporated  into  the  first  row  of  an 
(r-s)x(r-s)  elementary  polynomial  matrix  X(z)  and  it  is  easily  seen  that 


Ue(z) 


1 

0 

s 

s,  r-s 

x'(z) 

X(z) 

Y(z) 

F(z) 


(73) 


is  an  rxr  polynomial  matrix  which  incorporates  C(z)  into  its  first  s+1  rows,  irrespective 
of  the  choice  of  (r-s-l)xs  polynomial  matrix  Y(z).  But 


detUe(z)  = detX(z)  . detF(z)  = nonzero  constant 


(74) 


so  that  Ug(z)  is  elementary  and  the  proof  of  Theorem  4 is  complete,  Q.E.  D. 

Final  comment.  Theorems  l to  4 have  the  correct  system  emphasis  and  should 
prove  useful,  but  unfortunately,  the  authors  have  as  yet  been  unable  to  translate  the 
FC  property  into  a tractable  criterion.  This  is  a serious  hiatus,  because  in  our 
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opinion  it  is  FC  which  lies  at  the  heart  of  the  n -dimensional  synthesis  problem.  We 
hope  to  address  this  and  other  related  open  questions  in  a future  paper. 

Rome  Air  Development  Center 

F-30602 -78 -C -0048  D.  C.  Youla  and  G.  Gnavi 
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THE  IDENTIFICATION  OF  LINEAR  DYNAMICAL  SYSTEMS  FROM  TIME-DOMAIN 
MEASUREMENTS:  A CRITICAL  STUDY  OF  CERTAIN  ASPECTS  OF  PRONY'S  METHOD 

D.  C.  Youla 

The  problem  of  identifying  the  impulse  response  of  a lumped,  linear  time- 
invariant  system  from  a discrete  set  of  its  samples,  is  essentially  finite -dimensional 
in  character. 

Prony's  method  is  distinguished  by  the  fact  that  it  recognizes  this  finite- 
dimensionality  from  the  outset  and  thereby  succeeds  in  avoiding  much  of  the  numerical 
instability  exhibited  by  algorithms  that  are  designed  to  work  in  an  infinite -dimensional 
setting. 

This  report  attempts  to  generate  some  new  insight  into  both  the  theoretical  and 
practical  implications  of  Prony's  ideas  and  it  is  believed  that  Theorems  1-4  make  a 
modest  contribution  in  this  direction. 

A.  Formulation 


H(s)  = 


b„  + b,s+...+b  ,s 
0 1 n-  1 

n 

a.  + a. s + • • • + a s 
0 1 n 


an  / 0 • 


denote  the  transfer  fun  timi  .1  u dynamical  causal  linear  time -invariant  analogue 

system.  Let  ( ( denote  the  distinct  poles  of  H(s)  and  m^,  m^,  . . . , 

their  ri  -pi  ti-.e  multiph  .1  • . linn,  the  associated  impulse  response  h(t) » H(s) 

and  i s of  the  to  mi 


h(t)  = V P (t)  e 1 . t > 0 


where  P (t)  is  a polynomial  of  degree  rn^-1,  i=  1 "*<1,  and 

m , + m-,  + • • ■ + m = n . (3) 

1 2 q 

Of  course,  the  s 's  occur  in  complex-conjugate  pairs  and  the  defining  coefficients  in 
H(s)  and  in  all  P's  are  real.  It  is  easily  shown  from  Eq.  (2)  that 


By  definition,  the  z-transform  of  the  1 -sided  sequence 
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ck=h(kT),  k = 0->oc 

obtained  by  sampling  h(t)  once  every  T seconds  is  given  by 


W(z)  = V c^z'k  = V,  h(kT)i 

K ‘ — ' _ 


<*  , q s.kT 

= T z'k  y P.(kT)e  1 
Li  L-1  l'  ' 

k=0  i=1 


q * S.kT  . 

= y V P.(kT)e  1 z‘k 

U —l  1 


i=l  k=0 


q <* 

V V | 
i=  1 k=0 


d . skT 
n — )e 


* S (",<£)  l «skT*'k) 

i=  1 k=  0 s=si 


y (p.(J-) — Uf — rY 

u \ i'ds  , sT  - 1 / 
i=l  1-e  Z s=s 


In  particular,  if  the  poles  of  the  analogue  system  are  all  simple,  each  P;(t)  reduces  to 
a real  constant  A.,  i=  l->n,  and  Eq.  (11)  collapses  into  the  familiar  formula, 


W(z)  = 


n A. 

= y 1 

Lj  s . T . 

i=  1 1 - e 1 z 


Observe  that 


(1)  W(z)  is  analytic  in  a neighborhood  of  z = (Causality), 

(2)  vanishes  at  z - 0 (h(t)  is  free  of  impulses)  and 

(3)  possesses  the  distinct  poles 


z.  = e 
i 


with  respective  multiplicities  rm,  i=  l-*q. 


3 Unde r the  transformation  z=eh  , two  s *s  with  the  same  real  parts  but  with 
imaginary  parts  that  differ  by  an  integer  multiple  of  2tt/T  map  into  the  same  z^.  We 
assume  that  the  sampling  rate  l/T  is  high  enough  to  preclude  any  such  "folding"  effect. 
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In  short,  the  task  of  identifying  h(t)  from  a knowledge  of  St  + 1 of  its  samples 
ck  = h<kT)  . k=  0 -*t  , (14) 

has  been  reduced  to  the  following  purely  algebraic  problem:  1)  Given  t + 1 real  numbers 
cq>Cj,  ‘ ’ c g a real  rational  function  W(z)  which  is  expressible  in  the  irreducible 

form 


n - 1 , 


W(z)  = 


z(f0+f1z+...  +fn  lz  " ) 


1 + g!z  + g2z 


= zf(z) 

2 n g(z)  ' 

+ . . . + g z B\  I 


(15) 


n > i • gn  i o 


and  whose  Laurent  expansion 


C1  . C2 


*w  ='o  + T-*-4t"-  + T-  * S V 

Z z k=f+l 


-k 


(16) 


about  z=  cc  possesses  the  prescribed  coefficients  c^,  k=  0 -*!.  (By  irreducible  we  mean 
that  f(z)  and  g(z)  are  relatively  prime.) 


2)  Having  determined  W(z)  in  step  1),  perform  the  partial  fraction  expansion 

(16a) 


q m. 

= V V B<1)(z  -z  )-k 
z Lj  U k i 


i=  1 k=  1 

and  use  the  B's  to  calculate  the  A's  in  the  partial  fraction  expansion, 


H(s) 


= V V A(i) 


i=  1 k=l 


A'  '(s  - s.) 
k l' 


-k 


(16b) 


If  all  poles  are  simple,  every  rm  is  equal  to  unity  and  we  can  immediately  make  the 
complete  identifications 

Ajl}  = B[l)  , i = 1 -» n . (16c) 

However,  if  some  of  the  poles  are  multiple,  the  relationship  between  the  B's  and  A's  is 
rather  subtle  and  it  becomes  necessary  to  exploit  the  interconnection  formulas  (4)  and 
(11).  (Another  approach  is  described  in  Theorem  4.) 

Step  1)  is  the  subject  of  Theorems  1 and  2 and  the  answer  to  2)  is  contained  in 
Theorem  3. 
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B.  The  Main  Results 


From  Eqs.  (15)  and  (16), 


z(f0+  fjZ+  . . . +fn  lzn  *)  = (1  + gjZ  + . . . +gnz“)  • 2_,  ckz 


n.  v -k 
) • ; c.  z 

Li 

k=0 


and  equating  coefficients  of  like  powers  of  z on  both  sides  we  get 
fn-l  = gnC0 
fn-2  = gn  - 1 C0+  8nCl 


f0  81  C0+  g2Cl+  •'  ‘ +8nCn-l 


0 


cr+  glCr+l  + g2Cr+2+--,+8nC 


r+n  ’ 


r > 0 


(17) 


(18) 

(18) 

(19) 


The  n + 1 equations  in  Eq.  (19)  obtained  by  letting  r range  from  0 to  n can  be  written  in 
the  matrix  form 

“l 


’co 

ci 

. . . c 

n 

C1 

C2 

• • • Cn+1 

Cn-1 

c 

n 

’ - ' C 2n - 1 

c 

n 

Cn+1 

• ' * C2n 

'n-1 


Or,  in  an  obvious  notation, 

c c = o , , 

n - -n+1 


= -n+ 1 


(20) 


(21) 


where  C is  the  coefficient  matrix  and 
n 

- = [s]  ’ - = (gl’  8 2 8n^ 


A solution 

C = (g0.  gj gn)' 


(22) 


(23) 


470 


SYSTEMS,  CONTROL  AND  NETWORKS 


of  Eq.  (21)  is  said  to  be  admissible  if 
1)  g0gn  f 0 and 


(24) 


2)  if  the  polynomial  f(z)  generated  by  Q by  means  of  the  recursions  contained  in 
Eq.  (18)  is  relatively  prime  to 


g(z)  = gQ+  gjZ+  . . . +g 


(25) 


Thus,  the  rational  function  W(z)  = zf(z)/g(z)  defined  by  an  admissible  £ is  of  degree 
precisely  equal  to  n.  Theorem  1 gives  an  exact  and  useful  characterization  of  admissibility. 
Theorem  1 . Let 


"n-1 


"co 

C1 

. . . c , 
n - 1 

'ci 

c2 

• * • cn 

C1 

C2 

. . . c 

n 

X 

3 

II 

c2 

c3 

• • • • 

* • * Cn+ 1 

Cn-1 

c 

n 

• • • C2n  2 

c 

n 

Cn+1 

1 

c 

fM 

U 

(26) 


Then,  the  homogeneous  linear  system  Eq.  (21)  possesses  an  admissible  solution  £ iff 

(27) 


rank  C = rank  C , = rankH  = n 
n n- 1 n 


Such  a solution  is  unique  up  to  multiplication  by  a nonzero  scalar. 

Proof.  Sufficiency.  Suppose  that  Eq.  (27)  is  true.  Then,  is  singular  and 
Eq.  (21)  possesses  one  linearly  independent  solution,  C = (gQ>  gj  > • • • « gn)»  We 
maintain  that  g^g^  j-  0.  Indeed,  g^  = 0 implies  that 


"n-1 


= 0 
-n 


(28) 


lgn-lj 

while  gQ  = 0 implies  that 

V 


H 


= 0 

• n 


(29) 
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and  in  both  cases  the  nonsingularity  of  ^ and  Hn,  respectively,  forces  Q to  be  the 
zero  vector,  a contradiction.  Hence,  without  loss  of  generality  we  may  set  g^=  1 and 
g(z)=  l + gjZ+  ...  +gnzn,  gn  i °. 

By  construction,  g(z)  and  the  polynomial  f(z)  generated  by  £ with  the  aid  of  Eq. 
(17),  define  a rational  function  W(z)  = zf(z)/g(z)  whose  Laurent  expansion  about  z=  oo 
possesses  c^,  c^,  ...,  c^  as  its  first  2n+l  coefficients.  This  implies  that  f(z)  is 
relatively  prime  to  g(z).  Suppose  the  contrary.  Then,  degree  W(z)  < n-1  and  from  a 
well  known  result,  rank  Hn<  n-1,  a contradiction.^  Thus,  W(z)  has  all  the  desired 
properties. 

Necessity.  Let  Eq.  (21)  possess  an  admissible  solution  £ = (1 , g j , •••»  gn/  and 
let  f(z)  denote  the  associated  polynomial  synthesized  by  the  recursive  scheme  Equation 
(17).  From  the  definition  of  admissibility,  W(z)  = zf(z)/g(z)  has  degree  n.  Moreover, 
since  it  is  again  obvious  that  the  Laurent  expansion  of  W(z)  about  z = oc  duplicates  the 
coefficients  Cq,  c^,  ...,  c^  correctly,  rank  Hn=n. 

Now  clearly,  the  function  W(z)  = f(z)/g(z)  is  also  of  degree  n and  possesses  the 
Laurent  expansion 


W(z) 

z 


-z 

k=0 


ckZ 


-(k+1) 


(30) 


Thus,  appealing  once  again  to  the  result  quoted  in  the  footnote  below,  it  is  concluded 

that  rank  C = rank  C ,=n=rankH  , Q.E.D. 
n n-i  n 

Corollary.  Under  the  conditions  of  Theorem  1, 


(_Dn-iWiz)  = 


o 

u 

C1 

• • • Cn 

co 

C1 

• • • 0 

n 

C1 

• • • 

c2 

* ’ * Cn+1 

X 

C1 

• • • 

C2 

• • 

• • • Cn+1 

Cn-1 

c 

n 

' ' * C2n-1 

Cn-1 

c 

n 

* * * c2n-l 

0 

*o(z) 

• * * *n-l(z) 

1 

z 

n 

• • • z 

(31) 


^Consider  any  rational  function  W(z)  of  degree  < n whose  Laurent  expansion  at 
z = oo  is  given  by 
oo 

-k 


W(z)  = Yi  ckz 


k=0 


Then,  c degree  W(z)  = rankHn+r , r = 0 -+  oo. 
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where 


Vz)=ck+  ck.lZ+--*+C0Z  ’ k = 0 -»n  - 1 . (32) 

Thus,  f(z)  and  g(z)  are  constant  multiples  of  the  numeratpr  and  denominator  determi- 
(c ) 

nants'  ' in  Eq.  (31),  respectively,  and  all  matrix  inversions  are  obviated. 

Proof.  From  Eq.  (21) 


g(z)  = gn(*n  - • 


Hence,  invoking  a well  known  result  from  the  theory  of  determinants,  we  find  that 
under  the  normalization  g^  = 1, 

. det  C . 

*n=<-l>  ‘He? TT^  <34) 

n 

and  g(z)  equals  the  denominator  determinant  in  Eq.  (31)  multiplied  by  (-l)n  V^et  H • 
From  Eq.  (18) 


c0  C1 


and  from  Eq.  (21),  with  g_=  1, 


A = det  A = determinant  of  matrix  A. 


V 
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Thus,  by  elimination, 


f(z)  = 


which  is  easily  recognized  as  the  numerator  determinant  in  Eq.  (31)  multiplied  by 
l/detH^.  Consequently,  ( — 1 )n  *W(z)/z  is  given  by  Eq.  (31),  Q.E.D. 

In  many  typical  problems  of  identification  the  known  integer  n is  such  that  a 
priori,  degree  H(s)  < n.  This  case  is  considerably  more  involved  in  its  numerical 
aspects  than  the  case  degree  H(s)  = n which  falls  within  the  scope  of  Theorem  1. 
Nevertheless,  it  is  completely  encompassed  by  Theorem  2. 

Theorem  2.  The  degree  of  W(z)  is  less  than  or  equal  to  the  prescribed  integer 
n(>  1),  iff  the  real  symmetric  Hankel  array 


1 

n 

o 

ci 

' * * Cn  ’ 

ci 

c2 

* • * Cn+ 1 

• • 

. . . . 

(38) 

c 

n 

cn+ 1 

1 

c 

Csl 

u 

possesses  the  following  properties: 
1)  For  some  p,  1 < p < n, 


1 

n 

o 

n 

• * * Cp-» 

’C1  c2  * • ‘ CP 

C1  c2 

• * * cp 

C2  C3  * * ‘ Cp+1 

rank 

Cp-1  cp 

• ' * C2p-2 

= rank 

Cp  cptl  * ’ * c2p*  1 

rank  C^^  = rank  Hp  = p . (40) 

2)  Every  principal  submatrix  obtained  by  bordering  C . with  one  row  and  the 

P“* 

same  one  column  and  two  rows  and  the  same  two  columns  is  singular.' 


'Observe  that  Cp.j  is  the  principal  pxp  submatrix  of  Cn  located  in  the  upper 
left-hand  corner. 
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Proof.,  The  necessity  of  1)  and  2)  is  a consequence  of  Theorem  1 and  footnote  (b). 


As  for  their  sufficiency,  we  first  invoke  a result  concerning  the  rank  of  a symmetric 

4 

matrix  to  conclude  that  rank  C = p. 


According  to  Theorem  1,  the  rank  equalities  in  Eq.  (39)  guarantee  the  existence 
of  a solution  Q = (1,  gj,  • • • , g )' , g / 0,  of  the  homogeneous  system 


V-  = Vi 


(41) 


By  substituting  this  £ into  Eq.  (18)  with  n=  p,  we  generate  a rational  function 


W(z)  = 


z(fg+  f jZ  + . . . + f^  , ZF  ) 


EzL 


p-1, 


1 + g,z+  . . . +g  zl 


(42) 


’1  °p~ 

of  degree  p v'hose  Laurent  expansion  automatically  duplicates  the  given  samples 


, , c2p-  I*  must  now  be  shown  that  c2pH>  c2p+2’  '*'•  c2n  are  a*so  rePro_ 
duced  correctly  if  p is  less  than  n. 


c0*  cr 


For  this  purpose  let  us  write  in  the  partitioned  form 


(43) 


”2p 


'2p+ 1 


C2pf  1 c2pt2 


. . . c 


2p-l 


pf  1 


• • • c 


2p 


.-1 


P-1 


p+1 


"2p-l  c2p 


. (44) 


TiT 


Let  the  rxr  matrix  A be  nonsingular.  Then 

rankLr|§]  = r 


-1 


iff  D = CA  B (see  Reference  3). 
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This  means  that  c^^  and  c2p^2  are  uni<luely  determined  by  c^,  c ^ , . . . , c^pfrom  the 

requirement  rank  C^=  p.  However,  the  Hankel  matrix  generated  by  the  Laurent 

expansion  of  the  rational  function  Eq.  (42)  has  C in  its  upper  left-hand  corner  and  also 
(b)  P „ 

satisfies  rank  Cn=p.  Thus,  ^2p+l  = c2pfl  anc*  ^2p+2=c2pf2'  P-ePeating  this  process 
on  Cpf2  it  is  seen  that  ^3=  c^  and  = c^,  etc.  , until  c2n  = c^,  Q.E.D. 

By  making  contact  with  the  modern  concept  of  degree  in  terms  of  Hankel  matrices, 
Theorems  1 and  2 succeed  in  giving  Prony's  algorithm^  the  proper  system  orientation. 
The  advantages  of  such  a setting  are  of  course  obvious.  Our  next  theorem  contains  an 
effective  explicit  solution  for  the  A coefficients  in  the  partial  fraction  expansion  Eq. 

(16b)  as  functions  of  the  B's  in  Equation  (16a).  It  therefore  brings  to  a close  the  entire 
circle  of  ideas  originated  in  Theorems  1 and  2. 

Theorem  3.  For  any  two  nonnegative  integers  k and  f , let^ 
k 


\,,  = Z<-»rkc  r1' 

r=0 

;So,osl  ' 

(45) 

Then,  for  every  i=  1 ->q, 

A?*  - B«  , 

(46) 

A(l)  = B(l)/(z.T)  1 
m.  m/  1 

1 1 

(47) 

and 

B^ 

k 

Sk-l,k  Sk-l,k+l 

’ * ’ Sk-l,m.-l 
1 

k(k-D 

m.(m.-l) 

k+1 

R(i) 

Bk+2 

Sk,k  Sk,k+1 

• • • 1 

A(i)_  1 (-D 

(-1)  2 

n c 

c 

k (z.T)*'1  k|(k+l):. 

. . (nv -1)1 

k+1,  k+1 

• • • &kf 

*<*> 

m. 


m . -k 
1 


<-i) 


k = 2 -*  m.  - 1 
1 


. . S. 


m.  - 1 , m . - 1 
1 1 


(48) 


The  binomial  coefficient  kCr  denotes  the  number  of  combinations  of  k things  taken 
r at  a time  and  the  sums  Sj^i  are  discussed  in  Section  E. 
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r(i)  i \ 

li  - \~7  sT  / 


ds  z - e' 


for  any  nonnegative  integer  I . 


Let  y = e . Clearly, 


J.  = TesT  -2.  = Tv  — 
3s  Ae  dy  ly9y 


and  therefore,  operationally, 


a = , a vi  _ Ti  a 
^7  - - T 


Hence,  applying  the  well-known  Liebnitz-type  rule 


, a a _ v (-Dk  c k.  a .i 
(y  s7)  - L k,i y (-§9) 


to  the  function  l/(z-y)  we  obtain,  in  view  of  Eq.  (58), 


‘i’-S  T' t-i)k-1ksk 

k=0 


■(k  + 1) 


Now,  by  taking  Eq.  (56)  into  account  and  grouping  terms  in  Eq.  (57)  it  is  readily  verified 


™±-'t 


BSl=  t piT'(-,>ksl'sk,,=l-i»k*f  ' l 7T  sk.l  Aitl  • 


k = 0 -»m.  - 1 


Rewritten  in  matrix  form,  Eq.  (63)  goes  into  the  upper -triangular  linear  system 
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1 0 0 


0 Sl,l  S1 , 2 


= 0 0 s„ 


l,m. -1 


2,  m.  -1 

i 


T2A<3i> 


m.  -1 
T 1 A 


m.  - 

(-z;)  1 


. . . S. 


m.-l,  m. -1 


and  Eqs.  (46)  to  (48)  drop  out  immediately  by  Cramer's  rule,  Q.E.D. 

Comment.  The  numerical  problems  that  are  encountered  in  performing  partial 
fraction  expansions  are  by  no  means  trivial.  However,  the  literature  on  this  subject 
is  rather  abundant  and  we  will  not  pursue  the  topic  further. 

C.  Worked  Examples 

We  have  chosen  two  examples  to  illustrate  some  of  the  mechanics  associated 
with  the  procedures  developed  in  the  above  theorems. 

Exam  pie  1 . Show  how  to  find  the  minimum -degree  transfer  functions  H(s) 
which  generate  the  samples 

cQ=  1.  c1=  2,  c2=  0,  c3=0  (65) 

Solution.  Ordinarily,  with  either  2n  or  2n+l  samples  we  attempt  to  construct 
W(z)  so  that  its  degree  is  less  than  or  equal  to  n and  it  is  therefore  appropriate  to 
study  Cn<  Since  4=2x2,  consider 


C2=  2 


where  the  sample  c^  is  unknown.  According  to  Theorem  1,  degree  W(z)  = 2 iff  c^  = 0 
and 
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1 2 

2 0 

rank 

= rank 

2 0 

0 0 

2 . (67) 

which  is  impossible.  Clearly,  degree  W(z)=  1 is  also  precluded  so  that  degree  W(z)  > 3. 
Consequently,  consider 


C3  = 


12  0 0 
2 0 0 c 

0 0 

0 


C4  c5 


C4  C5  C6 


(68) 


where  c^,  c^  and  c^  are  as  yet  undermined. 


According  to  Theorem  1,  degree  W(z)  = 3 is  achieved  iff 


detC3=  4(c^  - c4c6)  - c3  = 0 


and 


detC2=  -4c4^  0 ; det  H2  = -2c*  / 0 
Thus,  we  may  choose  any  c4  ^ 0 and  then  select  c,.  and  so  that 


3 2 
c.+4c.c,  -4c=0 

4 4 6 5 


(69) 


(70) 


(’I) 


The  particular  choice  c4=  4,  c^=0,  c^=  -4  yields  the  equation, 


1 

2 0 

0 

1 

2 

0 0 

4 

gl 

0 

0 4 

0 

g2 

0 

4 C 

-4 

- 

- 

L 3J 

(1, 

8 j > 8 

3)7  whose 

2 = 

II 

M9 

O 

g3  = 

- 

1 

7 

= «4 


(72) 


g<*>=  1 -7  - V 


(73) 

(74) 


The  polynomial  g(z)  possesses  the  three  simple  zeros, 


I 
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Z1  = 1 


'2,3 


From  Eq,  (18), 


and 


2 

z 


T 

z 

"T 


z + I 


zj~  + 2z  + 1 _ (zt  1)^ 
z3  + z - 2 (z-  l)(z3  + z + 2) 


1 

z - 1 


J-  . — 1 — + _i_  . — L_ 

«/7”  z z2  ji~  z ■ z2 


(75) 

(76) 

(77) 

(78) 


Thus , with  T = 1 , 

s j = In  1 = 0 , 

s2  3 = lnz2  3 = 0.34657  ± j 1.93216 


H(s) 


1^ 

s 


- — + -i- 

" s2  JT 


and 


h(t)  = 1 + 0.75593e°-34657tsin(1.932l6)t  . 


(79) 


(80) 


A check  shows  that  to  at  least  five  places,  h(0)  = 1,  h(l)  = 2 and  h(2)  = h(3)  = 0.  00001. 

It  is  easily  shown  that  the  most  general  denominator  g(z)  compatible  with  the  constraint 
c^  / 0 and  the  equality  in  Eq.  (68)  is  given  by 


g(z)  = 1 


2 

z 


(81) 


This  means  that  the  four  samples  Cq  = 1,  c^  = 2,  c2=  0 correspond  to  an  infinity 
of  different  pole  assignments  for  the  minimum  degree  H(s)  consistent  with  these 
samples . 

Nevertheless,  we  maintain  that  degree  H(s)  > 3 is  the  only  reasonable  conclusion 
that  can  be  drawn  from  the  given  data.  For  if  the  degree  of  the  "true"  H(s)  is  actually 
two,  say,  the  matrix 


I 
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L 


2 0 


0 0 


must  represent  a noise -corrupted  version  of  some 


where 

(2+  e j)  e3  - t\  i 0 . (84) 

In  other  words,  the  continuous  noise  components  n^,  n^,  n^  introduced  in  the  measure- 
ment of  h(l),  h(2)  and  h(3)  must  satisfy  the  equalities  n^  = -e^,  n^=  and  n^  = -e 
But  such  a triplet  constitutes  an  event  which  lies  on  the  zero-probability  surface, 

(2+  + nj)  (e3  + n3)  - (e2+  n2)2  = 0 (85) 

and  the  possibility  of  its  occurrence  must  be  discarded. 

Example  2.  Find  the  minimum-degree  H(s)  corresponding  to  the  eleven  noise- 
free  samples 

„ _ 17  1 7 

C0  " °-  ci  “ r*  > C2  4 ’ C 3 ' 16  ’ L4  1 

(86) 

_ 41  9 _ 33  . _ 97  _ 25 

c5  = IT  ’ c6  = 4 ’ c7  - IT  ’ c8  = 4’  c9  = TT  ’ c10  = T 


spaced  T = 1/ 4 secs,  apart. 

Solution.  Since  11  = 2x5+  1,  consider 


C5  = "T5 


97  25 

IT  ~4 
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Because  of  the  nonsingularity  of  C^,  degree  H(s)  > 5 (Theorem  1)  and  it  is  easily 
checked^  that 

C5C=  06  (88) 

where  Q = (1,  -3,  4,  -4,  3,  -1)\  Hence,  det  C^=  0.  In  addition,  direct  computation 
shows  that  det  H&  f 0 and  from  Theorem  1,  degree  H(s)  = degree  W(z)  = 5, 

The  polynomial  g(z)  is  given  by 

g(z)  = 1 -3z  + 4z2  -4z3  + 3z4  -z3  = (1  - z)3  (z2  + 1)  (89) 

and  H(s)  possesses  s ^ = 0 as  a pole  of  multiplicity  three  and  the  two  simple  complex 
conjugate  poles,  s^  ^ = i 4 lnj  = ± 2TTj.  Hence,  q = 3,  rrij  = 3 and  m-  “ = 1. 

From  the  recursions  in  Eq.  (18), 

f(z)  = (15  - 49z  + 47z2  - 17z3)  (90) 

whence  (all  details  are  omitted), 

W(z)  _ 15  - 49z  + 47  z2,  - 17z3  {91) 

z 16(1  - z)3(z2+  1) 


= iZ!  _ j/i  + _L  . 1 + * 

z+j  z-j  16  (Z_1)2  8 


Thus , 


J Z2,  . -J/2 + 2i__  + 

s + 2 tt  j s-2rrj  s 


d1*  A*1* 


AD  a(D  a(D 

2tt  A1  . A2  3 

s^  + ^nd  s s*  s3 


By  straight  substitution  of  Bj^  = 0,  = l/l®  anc^  - l/8  into  Eqs.  (52)  to  (54) 


we  get 


'The  data  cQ,  c1Q  has  been  generated  by  sampling 


h(t)  = t + sin  2TTt 
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a[1)  = 0 

A2n  = 4<T6  - = 0 


A ^ = 16B(3l)  = 2 


H(s)  = + 2 - Z TT  2 

s':  + (2n) 


(95) 


(96) 


and 

h(t)  = t2  + sin  2rrt  , 

which  agrees  with  footnote  (h), 

D.  The  Effect  of  Measurement  Noise 

In  essence,  the  theoretical  problem  of  identification  is  that  solving  an  equation 

A(x)  = y (97) 

for  x,  given  y and  the  operator  A.  If  the  inverse  A * exists,  the  solution  is  given  by 
the  rule 


x = A” 1 (y) 


(98) 


which  is  applicable  to  every  y in  the  range  of  A.  However,  due  to  the  inevitable 
presence  of  measurement  noise,  Eq.  (98)  may  not  constitute  a practical  solution. 

Irdeed,  suppose  that  y^  is  the  "true"  image  of  xn  and  that  because  of  noise  dis 


0 


turbance  d,  is  cor rupted  into  y = y^  + d.  Then,  any  scheme  used  to  implement  Eq. 
(98)  will  return  (in  exact  arithmetic), 


x = A " 1 (y'Q  + d) 
instead  of 

xo=  A’1(V 

and  it  is  therefore  important  to  assess  the  magnitude  of  the  error 
€=  Hx-xJI 


(99) 


(100) 


(101) 


in  some  appropriate  norm , ||  , ||  . In  particular,  it  is  clear  that  a stable  inversion 
procedure  must  be  such  that  e tends  to  zero  as  ||  d ||  ->0.  Or,  stated  differently,  the 


identification  problem  is  well-posed  if  the  inverse  operator  A ^ is  continuous. 

Theorem  4.  Let  the  total  number  of  poles  n of  a transfer  function  H(s)  of  type 
(1)  be  known  in  advance.  Then,  if  h(t)»H(s),  the  problem  of  restoring  h(t)  from  2n  + 1 
of  its  samples  c^=h(kT),  k=0-*2n,  is  well-posed  if  the  sampling  rate  l/T  is  high 
enough  to  preclude  folding. 

Proof.  According  to  the  corollary  to  Theorem  1,  in  the  absence  of  noise  the 
number  of  distinct  z^'s,  q say,  and  their  respective  multiplicities  m.,  i=  1 -*  q,  are 
determined  as  the  zeros  of  the  denominator  determinant  in  Equation  (31).  From  the 
z.'s  we  obtain  the  q distinct  exponents  s.  = ^lnz.,  i = l-+q,  and  it  follows  that  the  only 
unknowns  left  in  h(t)  are  the  mt  + m2  + . . . + m^  = n coefficients  of  Pj(t),  P2(t),  • • • , Pq(t). 

In  turn,  the  vector  of  these  n coefficients  is  determined  as  the  solution  of  the 
nxn  linear  system, 

^ s . kT 

ck=  £ Pi^k  T)e  1 , k = 0 -»  n - 1 (102) 

i=l 

Moreover,  this  solution  is  unique  because  the  system  determinant  is  a generalized 
Vandermonde  which  cannot  vanish. 

Let  us  continue  to  use  the  zeros  of  the  denominator  determinant  in  Eq.  (31)  as 
our  estimates  for  the  z^s  even  in  the  presence  of  noise.  By  continuity,  as  the  noise 
becomes  evanescent,  these  roots  coalesce  into  distinct  non  over  lapping  clusters.  The 
number  of  such  clusters  yields  q and  a count  of  the  number  of  z^s  in  the  i—  cluster 
yields  rm,  i=  1 -» q . 

As  an  approximation  to  the  noise -free  z.  we  choose  the  center  of  gravity  Q.  of 
th  * 

all  the  roots  in  the  i — cluster.  Clearly,  as  the  noise  goes  to  zero,  Q ^-»  (zj)true> 
i=l-*q,  an<^  cjQ"+ (cjc)true*  k = 0 -»  n - 1 . Consequently,  the  solution  of  Eq.  (102)  tends 
to  the  true  coefficients  and  the  impulse  response  calculated  by  means  of  this  algorithm 
has  the  true  h(t)  as  its  limit,  Q.E.D. 

Comment  1.  The  technique  employed  in  the  proof  of  Theorem  4 bypasses  the 
need  to  determine  f(z)  and  the  partial  fraction  expansion  of  f(z)/g(z).  Note  however, 
that  in  the  solution  of  Eq.  (102)  only  n of  the  2n  + I samples  are  used,  and  the  question 
of  which  set  of  n to  choose  is  left  unanswered.  At  this  point  it  becomes  necessary  to 
take  the  statistical  properties  of  the  noise  into  account. 

Comment  2.  Unlike  several  other  methods,  Prony's  approach  has  the  great  merit 
of  exploiting  the  finite -dimensional  nature  of  the  problem  from  the  outset  and  it  is  this 
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that  underlies  the  validity  of  Theorem  4.  Nevertheless,  the  algorithm  can  exhibit 
severe  ill-conditioning,  especially  in  the  presence  of  multiple  poles.  The  availability 
of  the  explicit  formulas  (45)  to  (48)  derived  in  Theorem  3 should  help  relieve  the 
situation. 

Comment  3.  The  original  work  by  Prony  was  published  in  1795,  the  same  year 
Gauss  presented  his  theory  of  least  squares  estimation.  As  we  have  seen,  the  tech- 
nique is  ideally  suited  for  modal  analysis  and  in  the  hands  of  Van  Blaricum  and  his  co- 
7 8 

workers,  ’ has  proven  itself  to  be  an  effective  diagnostic  tool  for  exploring  the  com- 
plex resonances  of  a system.  It  continues  to  stimulate  researchers  in  many  different 
fields. 

E,  Appendix 

For  any  two  nonnegative  integers  k and  f let  a.  . denote  the  coefficient  of  x*  in 

K,  x 

the  power  series  expansion  of 


a(x)  = (1  - eX)k 

about  x=  0.  By  the  Binomial  theorem, 


(103) 


k 

s 

r=0 


a(x)  = y;  (-l)rkCrerx 


whence. 


1 


k 

V 

Li 

r=0 


V , , , r k _ f 1 „ 

L crr  = TV  sk,f  • 


M 

However,  since 

oc  r 

x v x 

e = L FT 

r=0  • 

Equation  (103)  is  also  expressible  as 


a(x)  = (-x)k(l  + jr  + jr  + • • • 


(104) 


(105) 


Thus , 


ak ,1  Sk,  I = 0 


I < k 


(106) 


(107) 


and 
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Sk,l  =<-1)kiIbk,i-k 


t > k 


(108) 


where  ^ is  the  coefficient  of  xl  in  the  expansion  of 

2 L. 

,,  . x x .k 


(109) 


But 


(1  + IT  + JT  + 


Ok=  i + |x+!^i-LIx2+  4^LLx3  + 


+ ^—5-  (15k3  + 30k2  + 5k-2)x4  + 

10(41 ) 


+ ^—5-  (3k3  + 10k2+  5k  - 2)x5  + 

20(41 ) 

+ — x-(63k5+  315k4+  315k3  - 91k2  -42k  + 16)x6  + 

7! (41 )^ 


(HO) 


For  possible  future  reference  we  can  therefore  generate  the  list  S^  ^+1-’  r = 0 -»  8: 


5k.k  = <“1> 


kk. 

k# • 1 \ 1 k 


sk,k+i  -(-i)  (k+o: f . 

Sk,k+2  = (-1)k(k+2>:  jr  (3k+  1)  , 

Sk,  k+3  =(-Dk(k+3):  y^rtk+l). 

S.  k+4  = (-Dk(k+4)!  — T (15k3+  30k2  + 5k  - 2) 

k,k+4  10(4!  )2 


(111) 

(112) 

(113) 

(114) 

(115) 


Sk  k+5  = (-DNk+S):  — 2'  (3k3  + 10k2  + 5k -2) 

K,K+S  20(41  r 


k k4.A  = ("l)k(k+  6)'.  — , (63k5  + 315k4+  31  5k3  - 91k2  - 42k+ 1 6)  , (117) 

K»Kt°  7 I {4I  J4 


(116) 


SYSTEMS,  CONTROL  AND  NETWORKS 


487 


I 


Sk,k+7  =(-1>k(k+7)!  3T4161  (9k6'1'  65r5+  105k4 -7k3  - 4074k2  + 12112k-  8064)  , 

(118) 

Sk  k+x  = (-Uk(k+8): - c-(135k7+  1260k6+3150k5+840k4-2345k3  + 

’ 5191(2!) 

540k2 + 404k  - 144)  . (119) 

With  the  aid  of  formulas  (111)  to  (119)  it  is  a straightforward  matter  to  program 

Eq.  (48)  for  multiplicities  rm  < 9. 

SCEEE 

Post-Doctoral  Program  D.  C.  Youla 
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ON  ORDER  DETERMINATION  OF  UNEAR  AR  MODELS 
F.  Nakajima  and  F.  Kozin 

A.  Introduction 

We  are  often  faced  with  the  problem  of  estimating  parameters  and  the  order  of 
a model  that  will  fit  a given  set  of  observations.  Akaike1  established  a procedure, 
which  estimates  order  as  well  as  unkown  parameters,  by  introducing  the  mean  log- 
likelihood  as  a measure  of  fit  and  by  maximizing  this  measure  of  fit.  However, 
Akaike's  formulation  requires  the  knowledge  of  the  probability  law  of  the  observations. 
Following  our  approach  to  characterization  of  consistent  estimators,2  in  this  report, 
we  shall  define  a criterion  as  a measure  of  lit  which  can  be  formulated  without  know- 
ledge of  the  probability  law  of  observations  and  which  has  a close  relation  with  the 
formulation  for  the  parameter  estimation  problem  established  in  our  companion  re- 
port on  "Characterization  of  Consistent  Estimates."  Using  this  criterion,  we  shall 
obtain  a procedure  to  estimate  the  order  and  the  parameters  present  in  linear  time- 
varying  AR  models. 

B.  A Criterion  for  Order  Determination 

We  assume  that  the  given  data  {y  } are  generated  by  the  following  linear  AR 
model: 

Yt  + a,(t  - l)yt_!  + • • • + ag(t  -k)yt_o  = b{wt  (1) 

y.i  = y.z  = 0 


l 

*L«  •*  * • 

•.*«*  - ^ 

, ••  .v  •. 

- 


where  {wf  ] is  an  nxl  independent  sequence  with  zero  mean  and  variance  I and  {y  } is 

O 0-0  * 

the  nxl  output.  In  the  system  Eq.  (1),  {a.(t),  i = 1,  2 kj  and  k represent  unknown 

coefficients  and  unknown  system  order,  respectively. 

We  shall  represent  the  time-varying  coefficient  {a^(t)  } as 
m 

a.  (t)  = V 
i'  ' U lj 

8=1 

for  each  i = 1,  2,  . . . , k where  {a. . } are  unknown  constant  parameters  and  [F.  .(t)  ] are 

q 

known  nxn  uniformly  bounded  matrices.  Furthermore,  we  assume  that  kc  {l,  2,  ...,  I }, 
where  I is  a possible  largest  order,  the  true  unknown  parameter  9.  =(all,...a,  , 

o o T l 1-11  lm 

a21  ’ * * * * afcm  * ® 0)  belongs  to  a compact  set  0in  R , L=f  xm , © ^ = 

= f®Ll  ll«Lll  £ c>  c < »}  and  0L  = (®L1,  9 


T 

L,  2*  • • • • ®L>  L>  = (all’  a12* 


a|m>* 


1 
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The  problem  of  model  fitting  is  often  formulated  as  one  cf  the  selection  of  the 
probability  density  function  (p.d.f.),  P(YN|0K>  based  on  observations  YN  given  0R 


- {y  , t=  0,  1 N-l  } and  0K  is  a parameter  restricted  to  the  space, 

= (0  , • • • , 0„  0,  . . . , 0)T  where  K=  kxi  , 1 < k<l  . We  call  k the  order  of 

K,  1 K,  K 


where  Y 
®K 


the  model.  To  discriminate  between  two  systems,  where  one  is  the  true  system  and 
the  other  is  the  estimated  model,  Akaike  introduced  the  Kullback-Leibler  mformation 
criterion  (KLC)  for  each  order  k defined  by  KLC(0L;0K;N)  = E {log  I*(  Y^  j 0^) - 
- log  P(Yn|0k)3.  Since  KLC  takes  the  value  zero  only  at  0K  = 0^  and  otherwise 
positive,  then  the  best  estimate  of  the  order  can  be  obtained  by  minimizing  KLC  with 
respect  to  the  order  ke{l,  2,  • . • , i}. 

In  this  report  we  shall  solve  the  problem  of  order  determination  without  know- 
ledge of  the  p.d.f.  of  Yn.  To  do  this,  we  choose  a functional  qN(6K)  of  YN>  0R  and  N 
which  satisfies  the  following  conditions 

(a-1) 


lim  E{q-,(0K)}  exists. 
N->°o 


(a-2)  lim  tq^l©^  " E {qN(0K>  31  - 0 W'P*** 
N-»°° 

(a-i)  lim  E{qN(0R)  -qN(0L)3  > 0 . 

N-*°° 

with  equality  iff  = 8^  . 


We  define  a criterion  for  discriminating  two  systems  as 

^®L:0K^  E*'qN(0K)  " qN(^L^ 

= lim  (qN(©K)  ' 9n(§l))  w.p.l 

N-»  oo 


(2) 


for  each  order  k.  The  above  criterion  i(8L;eK)  can  be  evaluated  without  knowledge  of 
the  p.  d.  f.  of  Y^. 

We  define  a compact  set  0*  in  RL  as  ©K  £ {0KI  H 0^ |i  <c.  c < or  3 for  each  order 
k and  let  us  define  a parameter  0R  in  ©K  as 

Min  ["  Hm  E { qM(0K)  3 1 = " 

in  is.  j N-»<» 


0Ke  0K  N_>” 
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Since  this  definition  of  0^.  implies  that  for  each  order  k,  1(0  : 0 ) > 1(8  : 0 ); 

O __  ^ q L K — L K 

then  1(6^:  0^)  indicates  the  best  fitting  measure  of  Q(0^)  for  Q(0 where  Q(0)  = 

lim  E{q.j(0)}  . This  fact  tells  us  that  if  we  can  evaluate  1(8.  :0-,)  for  each  order  k, 
N->oo  W E K. 

then  we  can  obtain  the  best  order  1c  such  that 

Min  1(8.  : 0„)  = 1(8.  : 0»)  , where  K = kxm  , 
l<k<l  K •“  “ 

o _ 

Our  final  aim  is  to  obtain  the  estimate  of  1(0^:  0K)  for  sufficiently  large  samples.  In 
order  to  do  this,  we  first  show  the  a.  s.  consistency  of  the  estimate  of  0^  for  each 
order  k. 

Assume  that  the  system  (1)  satisfies 
o o 

(b-1)  0le® l and  kc  t1.  2,  . . . , i ) . 


(b-2) 

(b-3) 

(b-4) 


bt  is  uniformly  bounded  and  non-singular  for  all  t. 
The  system  (1)  is  exponentially  stable. 


E ||  wt  ||  < c for  all  t > 0. 


(b-5) 


bt  is  a periodic  function  with  period  T , and  for  each  i=  1, 


. f 


{F^(t),  j=  1,  2,  . ..,  m}  are  linearly  independent  and  are  periodic 

functions  with  period  T . 

o 

We  note  that  the  periodicity  condition  (b-5)  may  be  relaxed  to  other  classes  of 

functions  for  which  lim  E{qM(0-.)]  exists. 

N->oc  W R‘ 

For  the  system  Eq.  (1)  with  conditions  (b-1)  to  (b-5)  it  suffices  to  choose  a 
quadratic  functional  9^(6^)  defined  as 


-N<v  ■ iUiFfW'rT 

t=0 


(3) 


.(M 

where  yj  = yt  + a^t  - 1)  yt  _ j + • • • + afc(t  ~k)yt.k  and  0Ke  ©K. 


Since  the  functional  satisfies  the  sufficient  condition  of  a.s.  consistent 

* Kf  _ * TJ 

estimate,  then  we  have  that  0^  0^  w.p.  1 as  N where  0^  is  defined  by 


i V 


N 


SYSTEMS,  CONTROL  AND  NETWORKS  491 

Note  that  9^(6^)  satisfies  the  conditions  (a-1)  to  (a-3)  given  above.  Applying  a central 
limit  theorem  for  martingales  by  Brown  and  Eagleson,  we  have 

• <»K  -5K>  ~N'0’  FfCK.N> 

where  N(0,  j-j  j^)  denotes  the  normal  distribution  with  zero  mean  and  variance 
R CK,N  and  CK,  N i®  *iven  bV 

CK, N = T qN(eK0  * 

ja  O 

From  the  above  two  results,  we  take  W,.  defined  below  as  an  estimate  of  1(0.  : 0 ) for 

^ L K 

sufficiently  large  N, 

= 2*®K  *0K^  CK,N*®K  "0K*  + qN*0K  * ' qN*0L* 
for  each  order  k.  Hence 

E(fKN}a-qA)  + qN<®K)  + W <*> 

where  the  expectation  is  taken  with  respect  to  the  asymptotic  distribution  of  oj'1, 

K 

o 

The  estimate  of  the  order  k is  obtained  from  Eq.  (4)  as  the  value  of  k,  that 
minimises 

qN(0Kl)  + TT  (5) 
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CHARACTERIZATION  OF  CONSISTENT  ESTIMATES 
F.  Nakajima  and  F.  Kozin 

A.  Introduction 

The  strong  consistency  of  parameter  estimators  has  been  studied  by  many  re- 
searchers in  recent  years.  Strong  consistency  results  have  been  established  for  Maxi- 

12  3 

mum  Likelihood  Estimates  (MLE),  ’ Least  Squares  Estimates  (LSE),  and  more 

4 5 

recently  for  Prediction  Error  Estimates  (PEE).  ' The  basic  characteristic  of  each 
of  these  estimates  is  that  they  are  defined  in  terms  of  an  extremum  value  of  some 
appropriate  function  of  the  observed  data  and  the  unknown  parameters.  The  strong 
consistency  results  that  are  presently  available,  require  conditions  on  the  appropriate 
functions  that  define  MLE,  LSE  and  PEE,  respectively.  Conditions  such  as  differenti- 
ability, with  respect  to  the  unknown  parameters,  existence  of  certain  limits,  etc.,  are 
usually  required. 

In  this  report  we  will  present  a reasonably  general  characterization  of  strong 
consistency  which  apparently  allows  us  to  treat  a broader  class  of  estimation  problems 
than  has  been  treated  before. 


We  establish  that  strong  consistency  is  basically  a question  of  limits  of  the  ex- 
tremal points  of  a suitable  sequence  of  functions  of  the  observations.  This  sequence 
of  functions  must  satisfy  certain  almost  sure  asymptotic  properties,  otherwise  they 
are  quite  arbitrary.  ^ 


B. 


In  this  brief  report  we  will  simply  state  the  main  theorem  and  some  results. 
Characterization  of  Consistent  Estimates 


Let  = {yk,  k=  0,  1,  . . . , N-l  ] be  a collection  of  observed  data  y^  generated 
by  some  system  which  is  parameterized  by  an  unknown  constant  parameter  8,  where 
8 lies  in  a finite  dimensional  space  Rn.  We  let  P(Yj^/8)  be  the  probability  measure  of 
Yjj  given  a true  parameter  8.  We  assume  that  8 belongs  to  a compact  set  © in  Rn. 


Then  we  have  the  following  theorem  which  establishes  the  convergence  properties 
of  the  estimates  of  8. 


Theorem  1. 

If  there  exists  a functional  of  Y^j,  8e©  and  N,  say  q^j(6),  which  satisfies  the 
following  conditions: 

,o 

(1)  q^(8)  is  measurable  with  respect  to  PJY^/e) 
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i 


(2)  inf  (q,  (0)  -q.(0)  converges  uniformly  on  © 
k>N  K K 

as  N -*  » , w.  p.  1 

o 

(3)  lim  inf  (q.  (0)  -q.  (0))  continuous  w,  p.  1 on  0 
N-»  <*>  k >N  K K 

(4)  lim  inf  (q.  (0)  - q.  (0))  > 0 , w.  p.  1 

N-*°ok>N 

»N 

with  equality  holding  iff  0cG,  where  G is  some  closed  subset  of  ©,  then,  0 , converges 

- N 

to  G w.p.  1 as  N tends  infinity,  where  0 is  defined  by 

Minq  (0)  = qN(9N) 

0e  © 

We  note  that  if  G contains  only  the  single  point  8,  then  0^  -*  0 with  probability  one,  and, 
hence,  the  estimate  is  strongly  consistent.  We  note  that  Theorem  1 holds  in  the  continu- 
ous time  observations,  Y^,  = {yt;  0 < t < T ),  the  limits  being  taken  as  T approaches 
infinity.  Furthermore,  under  suitable  conditions  Theorem  1 also  holds  in  the  mean. 

If  it  can  be  established  that  the  limit  function 

Q(0)  = Urn  qN(0) 

N-»« 


exist  with  probability  one,  then  conditions  (1)  to  (3)  of  the  theorem  are  simply  stated 
as, 

(1)  qN(0)  is  continuous  on  © for  each  N. 

(2)  lim  qM(0)  = Q(0)  uniformly  on  ©. 

N-»oo  w 

(3)  Q(0)  takes  its  minimum  value  on  the  set  GC®. 


C . Exam  pie  s 


As  one  of  the  possible  applications  of  the  main  theorem,  we  consider  the  system, 
A,.x,.  + SR+1  wk+1  , (1) 


k+ 1 


kk 


Vk  = Xk  + Vk 

with  initial  state  x , where  x,  , w.  , Y,  and  v,  are  nxl,  pxl,  nxl  and  nxl  vectors,  and 

q q O K K K.  K 

Ak>  Bk  are  time  varying  nxn,  nxp  matrices  which  include  unknown  parameters. 

Assume  that  £wk } , {v^}  and  xq  are  statistically  independent  of  each  other. 
Furthermore,  assume  that  {wkl  and  {vkl  are  independent  sequences  with  zero  mean 


• • -• 


1 


A 
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and  variance  I and  T , respectively,  and  xq  is  a random  variable  with  mean  x and 
variance  cq.  We  assume  that  the  following  conditions  hold  for  system  (Eq.  1): 

m 

o o o o o x 

(1)  A.  = > a.  A.  (k),  where  6,  = (a. a)  is  an  unknown  mxl 

k ii  a 1 m 

1=1 

constant  parameter  and  (A^k),  i=  1,  2,  ...,  ) are  all  uniformly 
bounded  matrices. 

(2)  The  state  system  (Eq.  1)  is  bibo  stable. 

N=  1 

(3)  lim  inf-^j-  V F.  > 0 (positive  definite) 

N-»oc  N JTn 


where 


Fk  = 


tr.  Aj  (k)  Aj(k),  ...»  tr.A/(k)  Am(k) 


1 \*V  **  l 

1 m 


tr.  Am(k)  A^(k),  ...,  tr.  Aj;(k)  A_(k) 


m m 


(4)  E||wk||8,  E||vk||^,  E ||  xq  ||  ^ are  all  uniformly  bounded  in  k. 

o o 

(5)  is  nonsingular  for  all  k and  is  parameterized  as  = BE^ 

o o 

where  B = (b^),  i,  j=  1,  2,  . • . , n is  an  unknown  constant  matrix  and 
E^  is  a uniformly  bounded  nxn  matrix  and  satisfies 
N-l 

D=“”riZ  ^ 

N-"00  k=0 

exists  and  is  positive  definite. 

Under  the  conditions  (1)  to  (5),  we  first  choose  a functional  9j^(9a)  given  by 
N-l 

qN  ^a*  =N  ^ [(yk+l  ‘ Akyk)  (Yk+l  ' Akyk*  “ tr’  r"  tr‘  Ak  AkF]  ’ 


then  q'  ;(8  ) satisfies  the  conditions  (1)  to  (3)  given  in  Theorem  1.  Hence  we  have  that 
w a -N  o o 

the  estimate  0 of  0 converges  to  0 w.  p,  1 as  N ■*  « . 

<1  3. 

(2) 

Next  we  choose  a functional  q^j  ' (0b)  as 


r 
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qN  )(0b)  " N -j  [*y 
k=0 


N-l 


k+1  yk+l 


T - BkB^ ) + (AR  T A^  - Ak  Yk  Y^  AR 


N 


)] 


and  establish  convergence  of  the  estimates  0,  of  the  magnitudes  of  the  unknown 
o b 

parameters  in  B. 
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A NONLINEAR  SERVO  FOR  LINEAR  SYSTEMS 
L.  Shaw  and  H.  Gambe 

This  report  describes  continuing  work  on  the  nonlinear  control  of  linear  svs- 
5 

terns  . Here,  the  nonlinear  regulator  ideas  introduced  in  Ref.  1 are  combined  with 
the  suggestion  in  Ref.  2 for  the  conversion  of  a regulator  controller  into  a servo  con- 
troller. The  horizon-based  servos  described  here  seem  to  be  superior  to  the  non- 
linear servos  in  Ref.  2 with  regard  to  quality  of  performance  (e.  g.  , less  overshoot 
for  the  same  rise-time.) 

Although  horizon-based  controllers  require  extensive  on-line  computation,  mi- 
croprocessor advances  are  markedly  reducing  that  disadvantage. 

The  use  of  state  observers  is  also  explored,  since  horizon-based  controllers 
are  inherently  of  state  feedback  form. 

A.  Introduction 

We  will  begin  with  a formulation  of  the  servo  configuration  in  which  a nonlinear 
state  regulator  can  be  used  to  speed  the  tracking  response.  Some  results  about  non- 
linear horizon-based  regulators  will  then  be  summarized.  Finally,  a few  simple 
numerical  examples  will  demonstrate  the  qualities  of  these  controllers. 

Methods  are  well-known  for  designing  linear  feedback  servos  to  control  the  out- 
puts of  linear  plants.  Here  we  consider  augmentations  of  such  a linear  controller  by 
a nonlinear  loop  which  speeds  the  response  when  tracking  errors  are  large. 

The  nonlinear  controllers  considered  here  are  those  previously  described  for 

1 3 

regulation  of  multivariable  linear  plants,  ’ and  they  can  be  viewed  as  providing  state- 
dependent  feedback  gains. 

Since  the  usual  servo  is  limited  to  operation  on  the  scalar  tracking  error  (ref- 
erence minus  output),  the  use  of  a state-dependent  nonlinear  controller  will  generally 
require  use  of  a state  observer  to  obtain  the  error-state  vector  which  is  to  be  driven 
toward  zero.  Even  in  the  rare  cases  where  all  states  of  the  plant  are  directly  ob- 
servable, direct  measurement  of  the  error-state  vector  will  be  possible  only  when 
tracking  step  changes  in  the  reference  signal.  When  tracking  higher  order  polynomial 
reference  signals,  derivatives  of  the  reference  signal  (which  are  not  directly  measur- 
able)are  essential  for  determination  of  the  error-state  vector. 

Figure  1 shows  the  usual  linear  servo  configuration  in  which  P(s)  may  include 
both  the  given  plant  and  a tandem  compensator.  If  the  denominator  of  P(s)(of  the  form 
s + a j s ...  an)  has  a^  = a^_  j • • • an  m+i  = 0 • then  P is  a type-m  system  which 
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can  follow  an  (m-1)  degree  polynomial  reference  signal  (r(t)  = ^ a.  t1"1)  with  zero 

steady-state  error  (e  (t)  — * 0 ).  This  kind  of  single-input,  single-output  system  can 
be  described  by  the  state  variable  differential  equations 


x = A x + b 


y = c'x 


(1) 


where 


A = 


0 -a 


b = 


(2) 


O « 

PIS) 

• 

k 

Fig.  1.  Feedback  servo  configuration. 

It  is  convenient  to  define  a reference  vector  £(t)  and  an  error-state  vector  e(t) 
as  follows 


r = 


e = r - x . 


(3) 


It  follows  that  the  last  (n-m)  elements  of  £ are  zero  and  that  £'£=  e , the  scalar 
tracking  error.  Moreover,  the  special  canonical  form  of  A used  in  Eq.  (2)  implies 

A £ = £.  • (4) 

Substitution  of  Eqs.  (3)  and  (4)  into  Eq.  (1)  yields  the  differential  equations 

£ = (A  - b £'  ) £ 

£ = A £ 

so  that  £ -»  0 if  A is  a stable  matrix. 


(5) 


498 


SYSTEMS,  CONTROL  AND  NETWORKS 


Figure  2 shows  how  the  foregoing  linear  servo  can  be  augmented  by  an  addi- 
tional nonlinear  state  reulator  in  order  to  effect  a faster  tracking  response.  This 
configuration  postulates  the  use  of  a dynamic  state  observer  which  has  knowledge  of 
the  first  component  of  £ and  the  nonlinear  control  input  u^ ^(e.)  . The  presence  of 
the  nonlinear  loop  changes  the  error-state  vector  equations  to 

e_  = A e + b u^  (e)  . (6) 


Fig.  2.  Servo  with  nonlinear  agumentation  loop. 


If  e were  to  equal  £ then  Eq..  (6)  would  be  in  the  form  of  a state  variable  reg- 
ulator problem  for  which  horizon-based  nonlinear  controllers  have  been  developed  in 
order  to  achieve  fast  reduction  of  "large"  initial  ^-vectors.  If  a compatible  observer 
is  used,  £-*•£,  so  that  use  of  Eq.  (6)  with  u^^  designed  assuming  £ a e will 
lead  to  an  asymptotically  stable  servo  for  which  e = e ^ — 0 faster  for  "large"  e . 

The  transient  response  of  this  configuration  can  be  studied  via  simulations. 

B.  Review  of  Regulator  Results 

References  1 and  3 describe  nonlinear  regulators  for  systems  of  the  form 
£ = A £ + Bu(£)  (7) 

with  (A,B)  controllable.  Those  regulators  are  based  on  linear  regulators  of  the  re- 
ceding horizon  or  virtual  horizon  type  in  which  the  horizon  parameter  acts  like  a 
scalar  measure  of  the  response  time.  The  nonlinear  versions  make  the  horizons 
state-dependent,  with  "large"  error  states  inducing  nearer  horizons  and  correspond- 
ingly faster  reduction  of  the  error  state  toward  zero. 

For  example,  the  linear  receding  horizon  controller  takes  the  form 
u(t)  = - R'1B'W"1(T)£  (t)  , 


(8) 
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where  R is  a positive  definite  matrix,  and  W(T)  is  the  positive  definite  solution  of 
the  linear  matrix  equation 


dW 

dT 


BR_1B'  - AW  - WA'  ; W(0)  = 0. 


(9) 


At  each  instant  of  time  t,  the  controller  in  Eq.  (8)  acts  to  drive  e_  to  £ at  a time 
(t  + T).  Choice  of  a smaller  horizon  parameter  T results  in  a system  which  regu- 
lates faster,  while  requiring  correspondingly  larger  control  energy.  (The  importance 
of  conserving  control  energy  can  be  introduced  by  the  choice  of  R.  ) 

The  nonlinear  receding  horizon  controller  makes  the  horizon  distance  a function 
of  the  state.  It  is  possible  to  make  this  T (e)  a decreasing  function  of  the  "size"  of  e 
in  such  a way  that  the  resulting  nonlinear  system  is  asymptotically  stable,  and,  as 
demonstrated  by  simulations,  the  system  responds  faster  to  larger  initial  e (0 ) . 

The  state-dependent  T (ej  measures  the  size  of  e_  via  a quadratic  form  which  is 
natural  for  the  system,  namely 

V(e,T)  = e'W_1(T)e  (10) 

It  can  be  shown  that  this  positive  definite  function  is  decreasing  in  T (for  fixed  e). 
Thus  the  described  T ( e)  relation  is  achieved  when  T is  defined  implicitly  by  the  con- 
straint 


V(e.T)  = F(T) 

for  a suitable  decreasing  function  F(T).  (See  Figure  3.) 


It  can  be  shown  that  if 

dF  dV 

dT  dT 


(11) 


(12) 


A 
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whenever  Eq.  (11)  is  satisfied  (i.  e.  , at  the  intersection  points  in  Fig.  3)  then  the  V in 
Eq.  (10)  is  a Lyapunov  function,  and  the  closed  loop  nonlinear  system  is  aymptotical- 
ly  stable.  The  references  discuss  methods  for  finding  suitable  F (T ) functions  which 
ensure  Equation  (12).  It  is  clear  that  if  F(T)  decreases  monotonically  from  infinity 

to  zero  as  T increases  from  T to  T , then  at  least  one  solution  (T  <T<  T ) 

m M m — M' 

to  Eq.  (11)  exists  for  each  e,  and  the  one  for  which  T is  largest  will  satisfy  the  sta- 
bility condition  Equation  (12). 

C.  Examples 

Since  analytical  solution  of  Eq.  (9)  is  difficult  except  for  system  of  low  dimen- 
sion, numerical  techniques  will  be  most  appropriate  for  finding  W(T)  and  solving  Eq. 
(11)  for  T(ej  at  each  instant  of  time.  While  development  of  such  programs  in  in  prog- 
ress, some  very  simple  examples  have  been  simulated  using  analytical  solutions  of 
Equation  (9). 

Referring  to  Eqs.  (2)  and  (5),  a "type  1"  plant  defined  by 


for  which  an  explicit  expression  for  W(T)  is  found  in  References  1 and  3. 

We  first  consider  tracking  step  changes  in  the  reference  input  when  the  plant 
states  can  be  measured  (no  observer).  Figures  4,  5 and  6 demonstrate  step  response 
properties  of  this  nonlinear  servo  when 

F(T)  = d2  ( 

for  various  step  amplitudes  and  various  values  of  d.  Also  shown  are  responses  of  the 

linear  servo  of  Fig.  1 and  the  linear  version  of  Fig.  2 when  T=T  = 1.  The  ability 

max 

of  the  nonlinear  controller  to  respond  relatively  faster  for  larger  step  amplitudes  is 
quite  evident.  Figure  6 shows  how  the  maximum  amplitude  of  the  nonlinear  control 
signal  increases  as  the  controller  is  made  "more  nonlinear.  " 


Since  the  tracking  of  higher  order  polynomial  reference  signals  will  require  the 
use  of  a state  observer,  it  is  of  interest  to  examine  this  same  step  response  example 
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when  an  observer  is  used.  With  the  observer  output  £ defined  by 
£ = F e tht  tbu 

we  can  choose  F = (A-h  c')  to  have  suitably  negative  eigenvalues  to  provide  fast  con- 
vergence of  the  observer  error  €_  = <e-i),  which  satisfies  the  differential  equation: 
£ = F c . 

(The  full-dimension  observer  has  been  used  because  it  has  been  noted  that  in  some 
control  problems  this  causes  less  performance  degradation  than  the  reduced  order 
observer.  ) 

The  effects  of  observer  convergence  rate  on  the  behavior  of  the  nonlinear  con- 
troller were  investigated  by  choosing 

1*1  - F | = (s  + 0)2 

for  several  values  of  0 . Figure  7 shows  the  unusual  results.  The  step  response  is 
more  sluggish  here,  as  compared  to  the  perfect  state  observation  case,  for  all  0 . 

As  p increases,  from  1 to  30  to  50,  the  response  becomes  slower  and  slower;  but 
changing  from  /3  = 50  to  /3  = 1 50  improves  the  speed  of  response! 

D.  Conclusion 


It  has  been  shown  that  a nonlinear,  horizon-based  regulation  can  be  used  to  aug- 
ment the  tracking  performance  of  a linear  servo;  more  work  is  needed  to  examine 
the  interrelationship  of  the  observer  convergence  to  a precise  estimate  of  the  error- 
state  vector,  and  the  servo  convergence  of  the  error-state  vector  to  zero. 
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EFFICIENT  DISCRETE  FOURIER  TRANSFORMATION  OF  REAL  VECTORS 
T . W.  Parsons 

Using  results  obtained  from  the  study  of  fast  convolution  algorithms,  S.  Winograd 
of  IBM  has  found  a discrete  Fourier  transform  (DFT)  algorithm*  which  is  more  efficient 
than  the  well-known  Cooley- Tukey  ( "FFT")  algorithm.  Efficiency  in  the  DFT  is  cus- 
tomarily measured  by  the  number  of  multiplications  required,  on  the  assumption  that 
multiplications  are  the  most  time-consuming  operations  and  that  therefore  they  account 
for  most  of  the  computation  time  required.  Where  the  FFT  requires  0(N  log^N)  real 
multiplications  to  transform  a complex  N-vector,  Winograd's  algorithm  requires  0(2N) 
real  multiplications. 

In  many  applications  of  the  DFT,  the  vector  to  be  transformed  is  real;  in  that 
case,  half  of  the  resulting  transform  is  redundant,  according  to  the  well-known  property 
of  such  transforms, 

Y(N-n)  = Y(n)  , (1) 

and  therefore  a certain  amount  of  unnecessary  computation  is  done.  In  spite  of  its  ef- 
ficiency, Winograd's  algorithm,  while  its  structure  lays  bare  the  symmetries  inherent 
in  the  transforms  of  real  vectors,  nevertheless  ends  up  carrying  the  usual  overhead  of 
redundant  values.  We  present  here  an  adaptation  of  Winograd's  algorithm  which  re- 
quires no  complex  arithmetic,  except  at  the  point  where  the  transform  vector  is  gen- 
erated or  (in  the  inverse  transform)  accepted,  and  which  does  not  compute  any  redundant 
values . 

We  write  the  DFT  if  an  N-vector  as  follows: 

Y = Wy  , (2) 

nk 

where  W is  an  NxN  matrix  whose  (n,k)  element  is  wN  and  w^  = exp(2irj/N).  Indexing 
is  zero- origin  throughout  and  the  exponent  is  computed  modulo  N. 

For  N a prime  or  a power  of  a prime,  Winograd's  algorithm  can  be  represented 
as  a factorization  of  the  W matrix, 

W = SC  T (3) 

In  this  factorization,  the  multiplications  are  confined  to  C,  a diagonal  matrix  of  approxi- 
mately N multipliers;  the  S and  T matrices  consist  only  of  0's  and  — 1 ' s and  thus 
merely  implement  output  and  input  additions  (and  subtractions)  of  the  elements.  Such 
factorizations  are  known,  and  set  forth  in  Ref.  1,  for  N C (2,  3,  4,  5. 7, 8,  9,  16}.  Be- 
cause of  their  limited  size,  these  factorizations  are  known  as  "small- N"  transforms. 

The  structures  of  all  of  these  factorizations  are  similar;  for  this  summary  we  will  take 
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the  case. 


S = 


N=5,  as  representative. 

~ 0 0 0 0 o" 

1 1 1 1-10 

11-110  1 
1 1-1-1  0-1 

1 1 1-1  10 


For  N = 5, 


(4) 


C = diag(l , - 1 . 25, . 559, j . 951 , j 1 . 54, - j . 36  3) 

T 1 1 1 T 

0 1111 
T 01-1-11 
0 1-1  1-1 

00-1  10 
0100-1 


(5) 

(6) 


Note  that  the  elements  of  C are  always  either  pure  real  or  pure  imaginary. 

If  it  is  known  that  y is  real,  then  the  elements  of  Y satisfy  Equation  (1).  It  is  not 
difficult  to  rewrite  S,  C and  T so  as  to  separate  the  real  and  imaginary  parts  of  Y and 
place  the  imaginary  parts  in  the  locations  otherwise  used  for  the  redundant  elements. 
Formally,  we  may  define  a real  vector  H such  that 

( Re[Y(n)]  , n < N/2 

H(n)  = ] (7) 

* Im[Y(N-n)]  otherwise 


The  elements  of  Y can  be  easily  recovered  from  H as  follows: 


Y(n)  = 


H(n) 

H(n)  + jH(N  - n) 
H(N - n)  - jH(n) 


N = 0,  N/2 
0 < N < N/2 
N/2  < N < N 


(8a) 


or 

Y=BH  , (8b) 

where  the  elements  of  B are  readily  determined  from  inspection  of  Equation  (8a).  Then 
we  may  write, 

H = S'  C'  Ty  , (9) 

where  S'  and  C'  are  modifications  of  the  S and  C matrices  of  Winograd,  and 

Y = BS'  C'  Ty  . (10) 

The  modified  matrices  are  found  as  follows.  For  S',  it  is  a matter  of  simple 
inspection  to  identify  those  portions  of  S which  contribute,  on  the  one  hand,  to  the  real 
and  imaginary  parts  of  Y and,  on  the  other,  to  the  n=0,  0<  n<  N/2,  n=N/2,  and 
N/2<  n<  N elements  of  Y.  Doing  so  leads  to  a partitioning  of  the  S matrix,  and  to 
form  S'  from  S,  we  need  only  drop  the  redundant  partitions.  For  example,  for  N = 5, 
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l 

0 

0 

0 

1 

-1 

0 

Finally,  C'  is  formed  from  C by  multiplying  its  imaginary  elements  by  - j. 


(ID 


(12) 


In  computing  the  DFT  of  a real  vector  using  the  factorization  Eq.  (10),  all  arith- 
metic is  real  until  the  final  multiplication  by  B.  As  can  be  seen  from  Eq.  (8),  however, 
multiplication  by  B consists  merely  of  assembling  the  elements  of  Y from  those  of  H; 
hence  we  have  for  computational  purposes  eliminated  all  complex  arithmetic. 

For  N large  and  composite,  a more  elaborate  process  is  required,  both  for 
Winograd's  algorithm  and  for  our  modification.  It  has  been  shown  by  Thomas^  that  if 
N can  be  factored  into  t mutually  prime  factors  Nj,  N^,  • • • , N^  , then  the  1-dimensional 
N-point  DFT  can  be  replaced  by  an  equivalent  f -dimensional  DFT  each  of  whose  di- 
mensions corresponds  to  one  of  the  factors  of  N.  In  the  Winograd  algorithm,  each 
dimension  is  transformed  using  a small- N transform  factored  as  in  Eq . (2);  because 
of  linearity,  the  various  steps  for  the  various  dimensions  can  be  handled  in  any  con- 
venient order,  and  in  Winograd's  algorithm  all  the  T- multiplications  are  done  first, 
then  all  the  C-multiplications,  etc. 

When  this  procedure  is  applied  to  real  vectors  using  Eq.  (1),  we  see  that  again 
all  arithmetic  required  is  real  until  the  final  multiplication  of  each  dimension  by  its 
proper  B matrix.  To  find  a way  to  implement  the  B- multiplications  without  requiring 
extensive  storage  of  complex  intermediate  values  takes  some  ingenuity.  Let1^  be  the 
l -dimensional  working  array  just  before  the  final  multiplications  by  the  B-matrices, 
and  let  Z by  the  t -dimensional  output  transform.  For  example,  for  t =2, 

Z = B,*T  B*  . (13) 

1m  2 

To  combine  these  multiplications  into  a single  step,  we  introduce  the  stacking  operator 
v(-),  defined  as  follows:  for  any  array  A,  v(A)  is  the  column  vector  formed  by  stack- 

4 

ing  the  columns  of  A.  It  can  easily  be  shown  that  using  the  stacking  operator,  Eq . 

(13)  can  be  rewritten 

v(Z)  = (Bj  x B^)  V(|T)  (14) 

where  Bj  xB^  is  the  Kronecker  product  of  Bj  and  B,.  (Although  Eq . (14)  applies  to 
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two  dimensions  only,  it  can  be  generalized  to  £ dimensions  by  induction  on  the  number 
of  dimensions  . ) 

Using  the  form  Eq.  (14),  we  can  combine  all  of  the  final  B-matrix  multiplications 
into  a single  step  with  a real  input  vector  v(^),  a complex  output  vector  v(Z),  and  no 
complex  storage  of  intermediate  results  other  than  a simple  accumulator. 

Space  does  not  permit  describing  the  implementation  of  Eq.  (14)  in  the  computer, 

but  we  may  observe,  first,  that  the  Kronecker  product  (B.  x B,  x ■ • ■ x B. ) does  not  have 

to  be  stored  in  memory;  its  elements  can  be  computed  as  required  from  the  elements 

of  the  B-matrices,  which  in  turn  can  be  found  as  needed  from  the  definition  Equation 

(8).  Second,  note  that  multidimensional  arrays  are  customarily  stored  in  computer 

memory  in  stacked  form  and  that  the  indices  required  to  single  out  an  element  of  are 

also  those  needed  to  address  the  corresponding  elements  of  the  constituent  B matrices. 

This  leads  naturally  to  a nested- loop  program  structure.  Finally,  by  using  nested 

loops  and  bypassing  all  inner  levels  of  nesting  whenever  a B-matrix  element  is  found  to 

be  zero,  we  can  reduce  the  N multiplications  implicit  in  the  definition  of  (B  x B,  x • • • 

f i * 

xBt)  to  somewhat  fewer  than  2 N additions.  A more  detailed  description  will  be  found 

in  Reference  5. 

The  inverse  transform  is  found  by  taking  the  conjugate  transpose  of  W;  this  re- 
verses the  order  of  the  factorization  in  Eq.  (10): 

-1  1 *lt  t# 

w =N  w =jrc'S'lB  . (15) 

i i 

jjg 

In  the  multidimensional  case,  the  B multiplications  now  come  first  and  can  be  imple- 
mented by  means  of  the  Kronecker-product  form  as  in  the  direct  transform. 

EE  Departmental  Research  T.  Parsons 
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SOME  COMMENTS  ON  IMPROVING  THE  EFFICIENCY  OF  LEAST  SQUARES 
ESTIMATION 

I.  Kadar  and  L.  Kurz 

It  is  well  known  that  least-squares  estimators  are  inefficient  when  pdf  of  the 
noise  is  heavy  tailed.  The  efficiency  can  be  improved,  however,  by  a batch-nonlinear- 
integer  rank  transformation  which  guarantees  asymptotic  normality  after  m -batch  pre- 
processing steps  (m  - 10)  and  robustizes  the  least-squares  estimators. 

Consider  the  relative  performance  of  the  least-squares  estimator,  b,  and  the 
best  linear  unbiased  estimator,  g of  g in  the  linear  model 

y = X g+V 

where  y,  which  is  nxl  and  X which  is  nxr  are  observed  and  the  nxl  noise  (error) 
vector  has  E(V)  = 0,  E[VV^]  = VarV  = I\  Assume  that  the  rank  (X)  = r,  rank  (T)  = n 
and  n > r.  Then  it  is  well-known*  that 

T-1T  * T-l  -1T-I 

b = (x1x)  xy  , g = (x  r x)  1 x 1 r y 

Var(b)  = (XTX)'1XTrX(XTX)'1,  Var(g)  =(XTr'1X)'1 

and  Var(b)  - Var(g)  is  non-negative  definite.  There  is  no  loss  in  generality  in  supposing 
that 

XTX  = [I] 

L Jrxr 

since  X is  given. 

The  question  arises  as  to  how  good  or  how  bad  is  b relative  to  g ? One  needs  a 
relative  measure  of  performance.  Since  X is  assumed  known,  but  Tis  known  rarely, 
it  is  of  interest  to  compute  a lower  bound  to  the  efficiency  measure  for  X fixed  and  all 
r in  some  class.  One  possible  measure  suggested  by  Bloomfield  and  Watson  is  the 
ratio  of  generalized  variances.  The  efficiency  of  b relative  to  g is  then 

|Var(g  | ( _ , 

Eff  = = {|xTrx|  |xT  r " 1 | } < 1 

| Var(b)  | J ~ 

where,  | • | is  the  determinant  of  the  matrix. 

2 

Conditions  for  which  Eff  attains  unity  are  well-known.  If  T is  o [i],  (identity 
matrix),  derived  from  an  i.  i . d population,  then  Eff  = 1.  This  suggests,  directly,  the 
use  of  a recursive  formulation  of  the  above  problem  via  robustized  stochastic  approxi- 

3 

mation  minimum  variance  least  squares  (SAMVLS)  assuming,  of  course,  that  multiple 
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observations  are  available  where  we  asymptotically  attain  Eff  = 1.  This  means  that 
having  formed  the  optimum  gain  matrix  or  an  appropriate  estimator  thereof,  asym- 
ptotically we  obtain  a robust  estimator  for  each  component  8,  -*  8 and  then 

'k  ' 

VSK11iV“<Vii>A“«  • »->.* 

where  V„A  are  the  diagonal  elements  of  r and  § is  the  best  linear  unbiased  estimator 

i<  1 

and  V-.  is  independent  of  the  underlying  CDF. 

it 

Once  robustness  is  guaranteed  (i.e.  b=  8)  we  are  now  at  liberty  to  choose  a better 
design  matrix,  X,  for  a particular  experiment.  This  is  the  subject  of  optimal  experi- 
ments, where  it  is  assumed  a priori  that  b=  § . Therefore,  we  have  extended  the 
theory  of  least  squares  by  the  introduction  of  robustness,  as  applied  to  optimal  designs. 

Joint  Services  Technical  Advisory  Committee 

F44620 -7 4-C -0056  I.  Kadar  and  L.  Kurz 

Grumman  Aerospace  Corporation 
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ADAPTIVE  FREQUENCY  DOMAIN  ESTIMATORS 
A.  Pupoulis 

A new  method  of  estimation  is  presented  combining  frequency  domain  techniques 
with  the  advantages  of  adaptive  and  recursive  filtering.  The  method  is  based  on  the 
representation  of  an  arbitrary  signal  in  terms  of  the  samples  F(t,mo>o)  of  its  "running 
Fourier  transform"  F(t,<«i)  and  it  assumes  no  prior  knowledge  of  any  statistics. 

The  algorithm  for  determining  F(t,mosQ)  is  a first  order  recursion.  The  filter 
weights  are  determined  adaptively  and  the  adaptation  algorithm  involves  either  a running 
estimate  of  the  required  statistics  or  an  instantaneous  version  of  a gradient-seeking 
LMS  search  leading  to  a generalization  of  the  Widrow  filter. 

The  study  includes  the  development  of  various  properties  of  running  transforms 
and  their  use  in  spectral  estimation. 

A.  Adaptive  and  Recursive  MS  Estimators 

Adaptive  filters  are  useful  because  they  require  no  knowledge  of  prior  statistics 

l 2 

and  they  can  be  used  to  process  stationary  and  non-stationary  signals.  ’ Recursion 
is  introduced  to  simplify  the  required  arithmetic  operations.  In  this  report,  we  develop 
a new  filter  that  combines  these  advantages  with  the  advantages  of  frequency  domain 
processing  techniques.  To  place  our  development  into  familiar  perspective,  we  discuss 
first,  with  some  originality,  two  known  methods  of  adaptive  filtering.  We  shall  use  as 
illustration  the  estimation  of  a discrete  process  y[n]  in  terms  of  the  output 
N-l 

y[n]  = Yj  akCn3  *[n-k]  = XT[n]  A[n]  (1) 

k=0 

of  a non-recursive,  time-varying  system  with  input  the  N samples  x[n-k3  of  the  data 
vector  X[n],  Our  objective  is  to  determine  the  N components  a^[n]  of  the  transfer 
vector  A[n]  so  as  to  minimize  in  some  sense  the  estimation  error 

e[n]  = y[n]  - y[n]  = y[n]  -XT[n]  A[n]  (2) 

1.  The  Adaptive  Wiener  Filter 

If  the  processes  x[n]  and  y[n]  are  jointly  stationary  and  the  optimality  criterion 
is  the  minimization  of  the  MS  error  E{e  [n]],  then  the  transfer  vector  A[n]  is  inde- 
pendent of  n and  is  found  by  solving  the  system 

R A = T (3) 


where 
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R = E{X[n]  XT[n]}  (4) 

is  the  correlation  matrix  of  the  data  vector  and 

T = E{X[n]y[n]}  (5) 

3 

is  the  cross-correlation  vector  between  the  data  and  the  signal  to  be  estimated.  The 
resulting  solution 

A=R_1r  (6) 

is,  of  course,  the  transfer  vector  of  the  Wiener  filter.  This  filter  is  optimum  but  it 
requires  prior  knowledge  of  the  matrix  R and  the  vector  T. 

In  the  absence  of  this  knowledge,  we  can  use  for  A[n]  the  solution  of  the  system 

R[n]  A[n]  = f [n]  (7) 

where 


R[n]  = (l-a)  £ akX[n-k]  XT[n-k]  (8) 

k=0 

n 

f[n]=(l-a  Yj  akX[n-k]  y[n-k]  (9) 

k=0 

and 

0<  a < 1 (10) 

It  can  be  shown  that  the  quantities  R[n]  and  T [n]  tend  to  R and  T,  respectively, 
as  n -*  oo  and  a -*  1 . Hence,  for  sufficiently  large  n and  for  a close  to  one  (see 
the  solution 

A[n]  = R 1 [n]  f [n]  (11) 

of  Eq.  (7)  is  nearly  optimum  as  in  Equation  (6). 

2.  The  Widrow  Filter 

4 5 

In  the  Widrow  filter,  ’ the  adaptation  algorithm  is  a first  order  recursion: 
A[n+l]  = A[n]  + |Je[n]X[n]  (12) 


where  u is  some  constant  that  determines  the  time  of  adaptation.  This  algorithm  is 
justified  as  an  instantaneous  version  of  the  gradient-seeking  method  for  minimizing 
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the  MS  estimation  error. 

The  Widrow  filter  is  simple;  it  requires  no  prior  knowledge  of  any  statistics;  it 
can  be  applied  to  stationary  and  non -stationary  processes.  It  has,  however,  a number 
of  disadvantages. 

The  solution  A[n]  of  Eq.  (12)  does  not  approach  a constant  as  n-»  oo.  The  vector 
A [n ] remains  random  for  any  n.  Various  attempts  have  been  made  to  analyze  Eq.  (12), 
however,  the  known  results  are  based  on  assumptions  that  are  not  generally  valid.5’** 
The  difficulty  is  that  Eq.  (12)  is  a time-varying  equation  with  random  coefficients. 
Indeed,  inserting  Eq.  (2)  into  Eq.  (12),  we  obtain 

A[n+  1 ] - (l  - |i  XT[n]  X[n])  A[n]  = p X[n]  y [n]  (13) 

Several  investigations  are  based  on  the  assumptions  that 

E{XT[n]  X[n]  A[n]  ] = E{XT[n]  X[n]  ] E{A[n]}  (14) 

If  this  holds,  then  the  mean  of  A[n}  tends  to  the  Wiener  optimum  Eq.  (6)  as  n ->oo. 
This  is  the  usua'  justification  of  using  Eq.  (12)  as  adaptation  algorithm.  One  of  the 
justifications  of  Eq.  (14)  that  appears  on  the  surface  to  be  not  too  restrictive,  is  the 
assumption  that,  all  processes  are  normal  with  zero  mean.*3  Whereas  the  normality 
assumption  for  the  data  is  not  too  restrictive,  it  does  not  hold  for  the  process  A[n] 
because  its  dependence  on  x[n]  and  y [n  ] is  strongly  nonlinear  as  we  see  from  Equation 
(17).  The  problem  of  determining  the  statistical  properties  of  the  solution  of  Eq . (13) 
has  been  investigated  for  various  special  cases,  * but  the  results  are  by  no  means 
conclusive.  Numerical  studies,  however,  have  shown  that,  in  a number  of  applications, 
the  filter  performs  well. 

3 • I he  Adaptive  F FT  F ilt  er 

In  the  preceding  filters,  the  number  of  weights  a,^[n]  that  are  adaptively  controlled 
equals  the  length  N of  the  filter  (see  Equation  (1)).  This  often  introduces  an  unnecessary 
complexity  in  the  design  if  the  required  value  of  N is  large.  We  mention,  as  a simple 
illustration,  the  adaptive  realization  of  a one-pole  system  with  the  filter  in  Equation  (1). 
In  this  case,  the  coefficients  a^[n]  must  approach  a geometric  progression  z ^ involving 
a single  parameter  z^.  For  a satisfactory  non-recursive  approximation,  however,  the 
length  N of  the  filter  must  be  large  if  zq  is  close  to  one. 

To  overcome  this  problem  we  impose  the  restriction  that  the  N weights  ak[n] 
linearly  dependent  on  M parameters  b [n] 


are 
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t>i  J 


A[n]=W3[n]  (15) 

In  the  above,  B[nl  is  an  M -vector  with  elements  b [nl  and  W is  an  N by  M 
J o m J ’ o 

matrix  to  be  determined.  Inserting  Eq,  (1),  we  obtain 

y [n  ] = XT  [n  ] W B [n  ] = Z T [n  ] B [n  ] (16) 

whe  re 

Z[n]  = WTX[n]  (17) 

Equations  (16)  and  (17)  specify  the  system  of  Fig.  1 consisting  of  the  matrix  W 
transforming  the  N-vector  X[n]  into  the  MQ-vector  Z[n],  followed  by  the  weights  B[n] 
and  an  adder.  The  M^-veclor  B[n]  is  adaptively  controlled. 


Fig.  1. 

The  selection  of  the  matrix  W depends  on  the  properties  of  the  processes  x[n] 
and  y[n],  Suppo.se,  for  example,  that 

x[n]  = y [n]  + v [n]  (17) 

where  v[n]  is  noise,  the  vectors  X[n]  and  Y[n]  are  elements  of  an  N-dimensional 
space.  If  the  .signal  vector  Y[n]  belongs  to  a sub-space  S^  of  dimensionality  M^,  then 
we  choose  for  W the  projection  operator  into  S^. 

Another  consideration  in  the  selection  of  W is  the  nature  of  the  resulting  vector 
Z[n],  If  the  autocorrelation  matrix  of  Z[n]  is  diagonal  or  nearly  so,  then  the  evaluation 
of  the  inverse  of  its  sample  matrix  R_1[n]  used  in  the  adaptive  Wiener  filter  (see  Eq. 
(11))  is  considerably  simplified. 
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These  two  considerations,  small  M ; data  vector  Z[nl  with  uncorrelated  com- 

o 

ponents,  are  satisfied  if  we  use  for  W the  matrix  resulting  from  the  Karhunen-Loeve 
expansion  of  X[n],  The  coefficients  of  this  expansion  depend,  however,  on  the 
statistics  of  X[n]  which  we  assumed  unknown. 

In  our  search  for  a suitable  transformation,  we  are  guided  also  by  another  factor: 
the  spectral  properties  of  the  various  signals.  In  a number  of  cases,  filtering  is  best 
performed  in  the  frequency  domain  - elimination  of  high  frequency  noise,  or  low  fre- 
quency clutter,  for  example.  In  the  design  of  the  filter,  it  is  desirable  to  select  as 
adaptively  controlled  variables,  parameters  that  are  directly  related  to  the  frequency 
domain  characteristics  of  the  various  signals. 

To  meet  these  requirements,  we  select  as  W a matrix  whose  elements  are  the 
roots  of  unity: 

wkm  w = ej2n/N  0 < k < N- 1 -M<m<  M (19) 

where  the  index  m is  suitably  changed  and  Mq  = 2M+  1.  The  elements  of  the  resulting 
vector  Z[n]  are  given  by  (see  Eq.  (17)) 

N-l 

F[n,  m]=  Yj  x [n  - k]  wmk  (20) 

k=0 

o 

that  is,  they  are  the  discrete  Fourier  series  (abbreviation:  DFS)  of  the  N samples 
x [n  ] , x[n-l],...,  x [n  - N + 1 ] 

of  the  input  sequence  x[n]. 

With  this  choise  of  W,  the  output  of  the  filter  of  Fig.  1 is  given  by  (see  Eq.  (16)) 

M 

y[n]  = Y bm  [n]  F[n,  m]  |m  | < M (21) 

m=-M 

and  the  coefficients  [n]  are  adaptively  controlled. 

It  appears  from  Eq.  (20)  that,  for  the  determination  of  the  Mq  components 
F[n,  m]  of  the  vector  Z[n],  the  required  number  of  multiplications  equal  Mq  times  N. 
However,  as  we  show  in  Section  C,  each  F[n,  m]  can  be  computed  recursively  with 
one  multiplication  only. 

In  the  following,  we  3tudy  various  properties  and  extensions  of  the  Fourier  filter 
of  Figure  1.  The  results  are  based  on  the  properties  of  running  transforms  to  be 
developed  next. 
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B.  Running  Fourier  Tr ansforms 

c 

The  running  transform  F(t,u)  of  a signal  f(t)  is  the  integral 


F(t,«)  = Jf(t+ t)  e'-^dT  (; 

-c 

where  c is  a given  constant.  For  a fixed  t,  F(t,w)  is  the  Fourier  transform  in  the 
Variable  T of  the  segment  f(t+  t)  of  f(t)  shown  in  Figure  2. 


t-c  t t+c 


-c  0 C 


Fig.  2. 


We  shall  presently  show  that  the  response  g(t)  of  any  time-limited  system  can  be 

expressed  in  terms  of  the  samples  F(t,mu  ) of  F(t,w).  We  start  with  the  following 

9 10  ° 

version  of  the  sampling  theorem:  * 

1.  Inversion  Formula:  For  any  t. 


f(t)  = ^ l F(t<m«  ) 


Proof.  We  expand  the  function  f(t+  t)  into  a Fourier  series  in  the  variable  T.  The 
coefficients  of  the  expansion  in  the  interval  (-c,  c)  are  given  by  (see  Eq.  (4)) 


c 

1 r -jmw  T . 

yr  / f(t+  t)  e 0 dT  = F(t,mw  ) 


Hence,  for  | t | < c 


f(t+T)  =^-  y F(t,mwQ)e 


jmcj  t 
o 


and  Eq.  (23)  results  with  T = 0. 


r 
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2«  Eiltering . We  next  show  that  if  the  impulse  response  h(t)  of  a linear  system  is 
time-limited: 


h(t)  = 0 for  1 1 1 > c 
then  its  response 
c 


g(t)  = f f(t-a)  h(a)da 

-C 

is  given  by 

00 

g(l)  = Ic  E F(t*mu0)  H(m«o) 

m=-oo 

where 

c 

H(«)  = / h(t)  e'jwtdt 


(26) 


(27) 


(28) 


(29) 


Proof.  Inserting  Eq.  (25)  into  Eq.  (27)  and  integrating  term-wise,  we  obtain 
Equation  (28). 

Varying  the  number  of  terms  in  Eq.  (28)  or  the  values  of  the  coefficients  H(m«o), 
we  can  design  filters  whose  frequency  response  is  adaptively  controlled.  We  shall 
elaborate  in  the  development  of  the  discrete  version  of  Equation  (28).  We  note  next  an 
extension  of  the  familiar  smoothing  method  used  to  estimate  a signal  y(t)  in  terms  of 
the  data 


(30) 


f(t)  = y(t)  + v(t) 

containing  the  noise  process  v(t). 

3.  Running  Low-Pass  Filters:  Suppose  that 
1 for  | m | < M 

0 for  | m | > M 

In  this  case,  Eq.  (28)  yields 

M 

g(t)  = fM(t)  = ^ £ F(t,m«o)  =^F(t,0)  + i Yj  ReF(t,mwo)  (32) 


H(muo)  = 


(31) 


m=-M 


M 

l 

m = 1 
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The  first  term  above  is  the  average  of  f(t)  in  the  interval  (-c,  c): 


fo(t)  = ^F(t’0)  = h S £<t+T>dT 


(33) 


If  fQ(t)  is  used  as  the  estimate  of  y(t),  then  the  variance  of  the  estimate  decreases 
with  increasing  c - it  equals  l/2c  if  the  autocorrelation  of  the  noise  equals  I6(t).  How- 
ever, its  bias,  due  to  the  smoothing  of  y(t),  increases.  For  an  optimum  estimation  for 
each  t,  the  length  2c  of  the  smoothing  interval  2c  must  be  adaptively  varied,  As  we 
show  next,  the  effective  length  of  the  smoothing  interval  can  be  controlled  by  the 
number  M in  Equation  (32). 


From  Eqs.  (22)  and  (32)  it  follows  that 

M c n c 


M 


i _ „ jmw  a i *'*  imu  a 

g(t)  = 2T  Z /£(t-a)e  ° da  = J f(t-a)  Z e ° da 

m=-M  -c  -c  m=-M 

This  shows  that  the  impulse  response  h(t)  of  the  running  low-pass  filter  is  given 


by 


pcw  “ 

- -h~  E 

m=-M 


jmwQt  sin(M  + i)wot 

e ° = p (t) 

c o»„t 

2c  sin 


(34) 


where 


P„(t)  = 


IM  < c 

1 1 1 > c 


The  transform  of  h(t)  yields  the  corresponding  frequency  response: 

M sin  c(u  -mu  ) 

HM  = Z 


m=-M 


c(w-mw  ) 


(35) 


In  Fig.  3,  we  plot  h(t)  and  H(w)  for  fixed  c and  for  M from  zero  to  five.  The 
effective  duration  of  h(t)  (main  lobe)  equals 


2tt 


2c 


4c  = (lM+ 1)w  = 2M  + T 


(36) 


and  it  decreases  with  increasing  M.  As  we  see  from  the  figure,  H(w)  is  essentially  a 
low-pas  3 filter  even  for  moderate  values  of  M and  its  cut-off  frequency  equals 


wc  * <M+i>w0  = rL 


(37) 
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The  running  Fourier  filter  is  an  extension  of  Eq.  (32)  and  it  is  obtained  if  the 
sum  in  Eq.  (23)  is  suitably  truncated.  Its  response  equals  the  sum 


M 

±-  l 

Zc  Lt 

m=-M 


Fft.mui^)  H(m«u  ^) 


where  H(ma>u)  are  2M  + 1 weights  that  a-e  adaptively  controlled  to  yield  estimates  of 
various  signals.  In  the  next  section,  we  develop  the  digital  implementation  of  the 
filter  and  the  adaptation  algorithms  for  determining  H(mwQ).  The  size  of  the  sampling 
interval  and  the  FFT  order  N are  dictated  by  well  known  factors.  The  value  of  the 
constant  M depends  on  'he  statistical  properties  of  the  signals  under  consideration. 

C.  Running  Discrete  Fourier  Series 

9 12 

The  running  DFS  F ^n,m  7 of  a discrete  signal  f [n]  is  the  sum7’ 


N-i 

F[n,  m]  - ^ f[n-k3 


j2n/N 


where  N is  a given  integer  (see  also  Eq.  (20)).  For  a fixed  n,  F[n,  m]  is  the  DFS  in 
the  -artable  k >f  the  segment  f[n-k'  of  f^n]  shown  in  Figure  5. 


n-N*i  n 


k N-I  0 


Fig.  5. 


Inversion  Formula:  For  any  n, 


ftn]  = jj  ^ F[n,  ni] 


Proof.  From  the  inversion  form’  la  'of  DFS  it  follows  that 


f[n-k]s-»r  ^ F"[n,  m]  w * 


and  Eq.  (40)  results  with  k = 0. 
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Filtering:  Suppose  that  f[n]  is  the  input  to  a non-recursive  filter  of  length  N. 


The  resulting  output  is  given  by 
N-l 

8 [n]  = £ f [n  - k]  h[k] 

k=0 


13 


(42) 


We  maintain  that 


14 


N-l 

^ Ft",  rn ] H(wm) 
m=0 


(43) 


whe  r s 

N-l 

m*)  = y h[k]  z~k 
k=0 

is  the  system  function. 


(44) 


Inserting  Eq.  (41)  into  Eq.  (42)  and  changing  the  summation  order,  we 
obtain  Equation  (43). 

Running  Discrete  Low-Pass  Filters:  If 

1 | m | < M 

0 | m | > M 


H(wm) 


then  Eq.  (43)  yields 

M 


M 


8 n = ft"  ^ F[n,m]=|^-F[n,  0]  + ^-^ReF[n,m] 

m=-M  m = l 

From  the  above  and  Eq.  (41)  it  follows  that 


g[n]  = — r 


M N-l  N-l  M 

IT  I Tj  f[n-k]wkm=i-  V f [n  - k ] £ 


km 


m=-M  k=0  k=0  m = -M 

Hence,  the  delta  response  of  the  running  low-pass  filter  equals 


M 


i,r«i  . 1 V nm 

hlnj  = w 


m = -M 


N sin 


Tin 

TT 


(45) 


(46) 


(47) 
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for  n between  0 and  N-l  and  it  equals  zero  otherwise. 

The  running  discrete  Fourier  filter  is  an  extension  of  Eq.  (47)  and  it  is  obtained 
by  truncating  the  sum  in  Eq.  (43)  (see  also  Eq.  (21)): 

M 

g[n]=-j^-  £ F[n,  m]  H(wm)  (48) 

m=-M 

The  design  of  this  filter  involves  the  selection  of  the  integer  M,  the  determination 
of  the  weights  H(wm)  and  the  computation  of  the  sequence  F[n,  m].  We  start  with  the 
last  objective. 


The  DFS  Analyzer 


The  number  of  multiplications  needed  to  compute  the  running  DFS  F[n,  m]  from 
Eq.  (41)  equals  N-l.  This  number  can  be  reduced  to  a single  multiplication  if  F[n,  m] 
is  computed  recursively.  Indeed,  from  Eq.  (41)  it  follows  that 


F [n,  m ] - w‘n  F[n-1,  m]  = f[n]  - f[n-N] 


(49) 


Hence,  F[n,  m]  can  be  determined  as  the  output  of  the  first  order  system^  shown  in 
Figure  6(a). 


Fig.  6(a). 

This  system  has  a pole  wm  on  the  unit  circle.  Furthermore,  it  requires  a shift 
register  or  some  other  method  for  generating  the  delayed  input  f[n-N].  We  discuss 
below  a modification  of  Eq.  (49)  that  avoids  these  difficulties. 

The  output  F^tn,  m ] of  the  system  of  Fig.  6(b)  satisfies  the  first  order  recursion 
Fa  [n,  m ] - a wm  Fa[n,  m ] = f [n]  (50) 

Solving,  we  obtain 

00 

Fa  [n,  m]  = Yj  w'm  f[n  - i]  (51) 

i=0 
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Fig.  6(b). 

Withi=k+rN,  the  above  yields 
N-l  oo 

Fjn.nO  = l w1™  l ak+rNf[n-k-rN] 
k=0  r=-oo 


(52) 


This  shows  that  F^tn,  m]  is  the  DFS  in  the  variable  k of  the  last  sum.  Hence 
(inversion  formula) 


2 aktrN([„.k.rNl  = ^ 

r=-oo 


Y Fa  [n,  m]  w-Krn 


(53) 


We  now  assume  that 
N 

a « 1 (54) 

With  this  assumption,  Eqs.  (52)  and  (53)  yield 
N-l 

Fjn,  m]  = Yj  Otkf[n-k]wkm  (55) 

k=0 

N-l 

akf[n  - k]  =-^j-  Y Fa  Cn.  m 1 w-1^  (56) 

m=0 

We  shall  now  express  the  output  g[n]  of  a non-recursive  filter  of  length  N in 
terms  of  F [n,  nti].  From  Eq.  (42)  it  follows  that 
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N-l 

g[n]=  Yj  akf  [n  - k]  a“k  h[k] 
k=0 

Hence  (see  Eq.  (56)), 

N-l  N-l 

g[n]=£f-  Y Fatn,  m]  £ h[k]a_kw‘‘Cm 
m =0  k=0 

But  the  last  sum  equals  H(ttwm)  (see  Eq.  (44)).  Therefore, 

N-l 

g[n]=  Y Fa^n’  ml  H(awm)  (57) 

m = 0 

This  is  the  modified  form  of  Eq.  (43)  and  it  holds  under  the  constraint  Equation  (54). 

The  resulting  filter  preserves  the  frequency  domain  properties  of  the  weights  H(awm) 
if  a is  close  to  one.  This  condition  is  compatible  with  the  constraint  Eq.  (54)  if  N is 
sufficiently  large. 

Truncating  Eq.  (61),  we  obtain 
M 

g[n]=j^-  Y Fa  Cn,  m]  H(a  wm)  (58) 

m=-M 

Equations  (48)  and  (58)  specify  the  running  frequency  domain  filter.  Its  implimen- 
tation  is  shown  in  Figure  7.  It  consists  of  2M  + 1 analyzers  as  in  Fig.  6,  followed  by 
2M+  1 weights  H(wm)  or  H(awm)  or  H(awrn),  and  an  adder. 

In  the  notations  of  Section  A,  g[n]  equals  the  estimate  y[n]  of  a signal  y[n]  and 
f[n]  equals  the  data  sequence  x[n].  We  give  next  the  recursive  determination  of  the 
unknown  weights  limiting  the  discussion  to  the  sum  in  Equation  (48).  The  results  are 
similar  for  Equation  (58). 

The  Frequency  Domain  Wiener  Filter 

We  denote  B [n]  a vector  whose  2M+  1 components  b^  [n]  are  the  adaptively 
controlled  values  of  the  weights  H(wm).  We  wish  to  determine  these  components  such 
that,  if  the  sum 

M 

ytn]  = Y bm[n]F[n,  m] 
m=-M 


(59) 
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I 


I 

I 


Fig.  7. 

is  used  to  estimate  a signal  y[n],  then  the  MS  value  of  the  resulting  error 
e [n  ] = y [n  ] - y [n  ] 

is  minimum.  In  Eq.  (53),  F [n,  m]  is  the  running  DFS  of  the  data  x[n]. 

If  the  processes  x[n]  and  y[n]  are  jointly  stationary,  with  known  second  order 

moments,  then  the  optimum  vector  B [n]  is  independent  of  n and  it  is  given  by  (ortho- 
3 

gonality  principle) 

B = R-1  r (60) 

as  in  Equation  (6).  In  the  above,  R is  a matrix  with  elements 

E{F[n,  m 1 F [n,  m ]}  (61) 

and  F is  a vector  with  elements 

E{F[n,  m]  y[n]  } (62) 

As  we  show  in  the  section,  these  quantities  can  be  expressed  in  terms  of  the 
moments  of  x[n]  and  y[n].  If  these  moments  are  not  known,  then  the  vector  B[n] 
can  be  determined  adaptively  as  in  Equation  (11). 


526 


IMAGE  PROCESSING 


The  Frequency  Domain  Widrow  Filter 

A simplified  version  of  the  adaptation  algorithm  is  the  gradient  seeking  recursion 
bm[n+ 1]  = bm[n]  + Um  e[n]  Ffn,  m]  (63) 

This  is  a modified  form  of  Eq.  (12)  where  we  use  a different  constant  u for 

m 

each  m.  We  do  so  because  in  many  cases,  it  is  desirable  to  have  a different  adapta- 

1 5 

tion  time  for  each  frequency  component  (speech  processing,  for  example).  This 
generalization  could  have  been  applied  also  in  Equation  (12).  However,  there  is  no 
obvious  advantage  to  a different  adaptation  time  for  each  term  x[n-k]  of  the  sum  in 
Equation  (1). 
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BOUND  ON  THE  PROBABILITY  OF  MISCLASSIFICATION  OF  TWO-DIMENSIONAL 
IMAGES 

L.  Kurz  and  P.  Legakis 

Recently  the  authors  introduced  an  efficient  statistical  approach  to  image  classi- 
fication. As  a measure  of  performance  of  the  procedure,  asymptotic  analysis  and 

simulations  were  used.  In  this  report,  a bound  on  the  probability  of  image  misclassi- 

2 

fication  basedon  the  two-dimensional  Chebyshev  inequality  is  introduced.  The  para- 
meters in  the  inequality  are  obtained  from  appropriate  simulation  studies. 


A.  The  Bivariate  Chebyshev  System 


Marshall  and  Olkin  have  shown  that  if  x and  y are  two  random  variables  with 
known  first  and  second  order  moments,  and  if  R(x,  y)  is  a non-negative  risk  function, 
then  E[R(x,  y)]  can  be  bounded  in  terms  of  the  first  and  second  order  moments.  This 
bivariate  Chebyshev  system  is  useful  if  one  wishes  to  establish  a lower  bound  on  the 
probability  of  correct  classification.  To  relate  the  Chebyshev  system  to  the  problem 
at  hand  with  the  help  of  Fig.  1,  let 


R^j  Prob  [(U,S)  e Region  H given  that  H is  true] 

R^.  Prob  [(U,S)  e Region  K given  that  K is  true] 

S The  signal  space 

U The  statistic  space 


then 


Prob  [correct  classification]  > y (R^  + R^,) 


Fig.  1.  Decision  space  for  Chebyshev  bound. 


under  H but  subscripts  are  omitted  for  notational  simplicity.  Following  Marshall  and 
Olkin,  R(x,  y)  must  be  quadratic,  possibly  with  linear  terms,  so  that 

2 2 

R(x,  y)  = ax  + by  + cxy  + dx  + ey  (4) 

subject  to 

a.  R(x,  y)  > 1,  for  all  x,  y exterior  to  Region  H 

b.  R(x,  y)  > 0,  otherwise  (5) 

Then 

00 

E[R(x,  y)  ] = f J R(x,  y)  f(x,  y)  dx  dy 
- oo 

- J I R<x*  y)  f<x>  dxdy 

(x,  y)  € (Region  H) 

> 1 ~ j f f(x,  y)  dxdy  (6) 

(x,  y)  e (Region  H) 

and 

Rpj  = Prob[(x,  y)  e Region  H]  = J J f(x,  y)  dxdy  > 1 - E[R(x,y)] 

(x,  t)  e (Region  H)  (7) 

To  maximize  the  probability  of  correct  classification,  R^j  must  be  made  as  large 
as  possible  subject  to  constraints  in  Equation  (5).  This  is  equivalent  to  maximizing 
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E[R(x,  y)]  subject  to  the  same  constraints.  The  overall  procedure  is  further  simpli- 

2 2 

fied  if  normalization  with  respect  to  o and  o is  used.  Let 

x y 


xo  ” i . CT 


and  y = — 

i - ° to  0 

lx  2 y 


(8) 


so  that  region  H is  centered  around  the  origin  and  is  bounded  by  |xo|  < 1 and  |yo|  < 1. 
The  coefficients  of  R(xq,  yQ)  are  chosen  to  provide  a minimum  for  E[R(xo>  y^)].  Co- 
efficients d and  e have  been  eliminated  via  normalization  of  first  order  moments.  Thus, 

2 2 

R(x  , y ) = ax  + by  + cx  y 
o o o 1 o oo 

The  coefficients  of  Eq.  (8)  must  be  chosen  to  satisfy  the  constraints  Equation  (5). 

For  constraint  a 
Let 

x = 0 
o 

Then 


R(o,  yQ)  = byQ  > 


->  b > 1 


|y0i  < 1 


1 Otherwise 


Let 


Y = 0 
o 


Then 


Let 


R(x  , 0)  = ax  > 
o o — 


-*  a > 1 

For  constraint  b 


x = ± 1 
o 


X < 1 

1 o 1 — 


1 Otherwise 


R(±  1,  yo)  = a + byo  ± cyo 


Then 
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where 


B , 


>1/K 


x/K 


2/K 


y/K' 


b2  = TT7TC 


, V«V*  n 

1/K52/K  V K °x/K0y/K; 


Since 


Prob  [correct  classification]^  j (R^  + R^) 
the  probability  of  misclassification  is  bounded  by 


4AJ 


4A_ 


Pel‘-i(RH+  V 


4B~Jb~.4B2 


4A2-[A|2-7Af-4A2  ] 4B22-[B12->2-4b2]' 


In  order  for  P^  to  be  specified  in  terms  of  first  and  second  order  moments,  a functional 
relationship  between  U and  S must  be  found.  Since  it  vs  analytically  very  difficult  to 
find  one,  the  simulation  route  was  followed.  Simulation  under  various  signal  conditions 
led  to  the  functional  forms  of  Figures  2 through  5.  So,  for  low  and  moderate  signal- 
to-noise  rations,  U is  related  to  S as  follows: 

Under  the  hypothesis : U = kj|s|  + k^ 

Under  the  alternative:  U = -kj|S|  + k^ 


and 

Eh[U]  = Mu/h  = kl  ^S/H  + k2 

VarH[u]  = oJ/H  = k,  0 l/H 

CovH[U,S]  = kt  o l/H 

D . kl °S/H  , „ 

P«  ’ kl  °S/H  °S/H  ' 

4‘/H  = °i/H  = °S/K  = *>/K 
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SHIFT-IN -THE -MEAN  SHIFT -IN -THE -MEAN 


Fig.  2.  Three-sample  Mann-Whitney  Fig.  3.  Three-sample  quantile  Mann- 
statistic  versus  shift-in -the  - Whitney  statistic  versus  shift- 

mean.  Signal  under  H.  in-the-mean.  Signal  under  K. 


» » 

u 1 u I 

SIGNAL  UNDER  h SIGNAL  under  K 


CHANGE-OF-SCALE  CHANGE -OF -SCALE 


Fig.  4.  Three -sample  quantile  Mann-  Fig.  5.  Three -sample  quantile  Mann- 

Whitney  statistic  versus  change-  Whitney  statistic  versus  change 

of-scale.  Signal  under  H.  of-scale.  Signal  under  K. 

ek£u1  = Mu/k  = -kiMs/H  + k2 
2 2 

V a rt,  [U 1 = 0,,,,.  = k,  c_  /„ 


5 34 
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CovK[U,S]=  - 

e -U°+1 

au/H 


~kl  PS/K  = 

1 °S/K  °S/K 

U -6 


Thus , 


P < 
e — 


4 / 

\2  ,2  1 
6 l/H+  62/H 

.2 

4pH 

^ l/H  ^ 2/H  V 

*2  £2 

L 6 l/H  62/H  J 

- 72  2 

6 l/H  62/H 

24  i 

rt2  + ?2 
4l/h+^2/H  , 

/T2  72  7~z  n 

l/H  ^ 2/H  4pH 

^ l/H  ^l/H> 

t 2 2 

L 6 l/H  5 2/H  '/ 

T2  T2  T2 — rz~ 

5 l/H  5 2/H  5 l/H6  2/H  J 

T2 
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QUADRATIC  TESTS  IN  M-GRAY  LEVEL  DETECTION 
L.  Kurz  and  P.  Legakis 

The  generalization  of  the  edge  detection  problem  from  two  to  M levels  under 

moderate  and  high  signal-to-noise  ratio  conditions  has  been  treated  by  the  authors 

elsewhere.  * If  images  are  severely  corrupted  by  noise,  as  is  usually  the  case  in 

sonar  and  radar  data,  the  procedures  suggested  in  the  past  do  not  perform  satisfactorily. 

2 

In  this  report,  an  m -interval  quadratic  test  is  used  to  develop  an  efficient  procedure 
of  M-gray  level  detection  of  images  severely  corrupted  by  noise.  A typical  result  of 
implementation  of  the  procedure  to  simulated  data  supports  the  theoretical  results 
presented  below.  Also,  a procedure  for  empirical  establishment  of  decision  thresholds 
is  outlined. 

A.  Quadratic  Tests 

When  the  generalized  signal-to-noise  ratio,  X between  two  hypotheses  and 

H.  , . defined  as 
k+  1 

oe 

\ = / (F(x-pk)  - F(x-nk+1))dF(x-nk)  (1) 

- 00 

is  moderate  or  large,  as  is  the  case  in  most  image  processing  under  noisy  conditions, 
the  statistic  introduced  in  Ref.  1 gives  good  results  and  is  very  easy  to  implement. 
However,  when  X is  small,  it  performs  poorly  and  a different  class  of  statistics  is 
recommended.  Such  is  the  set  of  quadratic  statistics  which  belong  to  the  general 
class  of  nonlinear  m-interval,  M-ary  detection  statistics.  ^ These  tests  are  based 
on  M-interval  equiprobably  partitioning  of  the  probability  space  and  are  of  the  "matched 
filter"  variety.  The  choice  of  nonlinear  over  linear  statistics  is  based  on  the  fact  that 
the  latter  are  suitable  for  stochastic  ordering  type  problems  only  while  the  nonlinear 
statistics  are  good  for  both  stochastic  ordering  and  change-of-scale  problems.  This 
property  makes  them  good  candidates  for  gray  level  and  texture  edge  detection  in  image 
processing  under  severe  noise  conditions.  The  quadratic  statistics  are  formulated 
as  follows: 

Based  on  reference  sample  X^ , X^,  ....  XM>  quantile  vectors  = (akJ , . . ■ 

akM^’  k=  *>  2,  • • • M,  are  selected.  The  selection  is  made  in  such  a way  that  when 

holds,  the  observation  space  is  equiprobably  partitioned.  Thus,  if  n^  is  the  mapping 

of  an  observation  vector  Y^  on  the  partitions  defined  by  and  if  Y^  - F(x  - then 

n,  is  an  accurate  m-interval  representation  of  Y and  the  partition  a.  is  such  that 
k n r k 


(2) 
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In  addition,  since  ^ n.  = n,  the  quadratic  statistic 

j=l  J 


shows  most  atypical  behavior  when  Yr  - F(x  - pR)  in  the  sense  that  it  attains  a minimum 
Thus,  if  Yn  is  mapped  into  every  ak>  k=  1,  2,  3,  . . . M,  and  the  statistics  Tk  are 
formed  according  to  Eq.  (3),  then  the  classification  of  Yn  is  done  on  the  basis  of 


min  (T  j , T2,  ...  Tm> 


For  an  extensive  treatment  of  the  small  signal  behavior  of  the  three-sample  quadratic 
statistic  based  on  quantile  occupancy  vectors,  its  asymptotic  comparison  with  the 
three-sample  Mann-Whitney  statistic.  Ref.  2,  is  recommended. 

B~  An  Empirical  Establishment  of  Classifier  Thresholds  and  Comparison 
of  Test  Statistics  ' — 

It  was  noted  in  Ref.  1 that  if  the  classification  of  a test  sample  is  done  in  parallel, 
M test  statistics  are  generated  simultaneously  and  a decision  is  made  on  the  basis  of 
min  (T  j , T2,  T3,  . . . TM).  Hence,  there  is  no  need  of  classifier  thresholds.  A sequen- 
tial procedure  is  slightly  different  in  the  sense  that  in  order  to  distinguish  between  two 
hypothesis,  Hk  and  Hk+  j a single  statistic  Tk  is  formed  and  a decision  is  made  on  the 
basis  of  whether  TR  exceeds  or  is  less  than  a certain  threshold.  The  usual  measure 
of  comparison  of  statistics  is  through  the  concept  of  asymptotic  relative  efficiency 
(ARE)  which  is  the  ratio  of  efficacies  of  the  statistic.  The  efficacy  <5  of  a statistic  T 
is  given  by  ^ 


<5  = 


Eim 
n -»  « 


H. 


k+1 


• H, 


is  EtV*W 

nVa'r  L'Tk/HkJ 


e=  o 


Where  n is  the  sample  size  and  6 is  the  parameter  that  separates  the  hypothesis  from 
the  alternative.  Notice  that  ft  as  defined  above  has  meaning  only  as  n -» oe  and  9 -+ 0. 

The  observation  to  be  made  is  that  in  both,  the  establishment  of  classifier  thresholds 
and  the  comparison  of  statistics  through  the  concept  of  ARE,  the  results  are  asym- 
ptotically valid.  In  many  applications  one  is  interested  in  neither  extremely  large 
samples,  nor  in  cases  where  the  alternative  approaches  the  hypothesis,  but  rather  in 
moderati  signal-to-noisc  ratios  and  finite  sample  sizes  This  is  particularly  true  in 
image  processing  where  the  signal-to-noise  ratios  are  usually  moderate  to  large  and 
one  would  rather  use  the  single  quantile  statistic  rather  than  the  quadratic  statistic 
which  takes  at  least  five  times  the  processing  time  of  the  linear  single  quantile  statistic. 
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An  empirical  procedure  is  now  suggested  winch  allows  arbitrary  sample  size,  qualita- 
tive and  quantitative  comparison  of  statistics  as  well  as  the  establishment  of  optimal 
thresholds.  The  outline  of  the  procedure  is  as  follows: 

Given  a binary  rectangular  (L  x L ) array  W that  contains  the  same  number  of 
zeros  and  ones,  generate  a new  array  W^where  w^.  takes  on  the  value  of  a test  statistic. 
The  test  statistic  of  the  i-th  row  and  j-th  column  of  Vv  is  based  on  a sample  which  is 
drawn  from  the  hypothesis  or  the  alternative  depending  on  whether  the  w . of  W is  a 
zero  or  a one,  respectively.  Thus  W is  an  estimator  of  W.  Let  L «-  (W  > threshold). 
Then,  L is  an  (L^XL^)  binary  array  consisting  of  elements  w^  such  that 


i . . = 

ij 


if  w.  . > threshold 
ij 


if  w/.  < threshold 
ij 


(4) 


Thus,  the  average  probability  of  correct  classification  is  given  by 


L L 
P 


L L 
p q 
V 


q i=l  j=l 


V (i  . . = w. .) 

L ij  ij' 


where 


(f  ,.c-w..) 
ij  iJ 


1 if  w.  . and  f . . are  equal 
!J  iJ 


0 Otherwise 

and  the  average  probability  or  misclassification  or  error. 


Pg  is  given  by 


p'  = 1 i 

e 1 L L_ 


q i=l  j= 


(f  . . = w.  .) 
ij  iJ 


Once  the  W7  array  has  been  obtained  for  a given  procedure,  the  threshold  can  be  ad- 
justed to  give  the  optimal  settings.  This  procedure  was  tried  for  the  quadratic  quantile 
tests  and  the  results  appear  in  Figures  1 and  2.  The  optimal  threshold  was  obtained 
in  less  than  10  iterations.  By  selecting  well  known  patterns  as  the  array  W one  can 
make  a qualitative  comparison  of  two  procedures  by  printing  images  of  the  reference 
pattern  and  using  the  optimal  threshold  for  each  procedure.  This  is  done  in  Figures 
3 through  5. 
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Fig.  3.  Uncorrupted  binary  reference  pattern . 
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Fig.  4.  Gray  level  detection  based  on  single  quantile 
statistics.  Reference  pattern  corrupted  with 
Gaussian  N(0,  1)  and  N(.  5,  1)  noise. 


xxxmxxxxxxxxxx 

xxxxxxxxxxxxxx 

[XXXXXXXXXXXXXXX 

[XXXXXXXXXXXXXXX 

XXXXXXXXXXXXXXX 

IXXXXXXXXXXXXXXX 

XXXXXXXXXXXXXX 

IXXXXXXXXXXXXXXX 

IXXXXXXXXXMXXXXM 

XXXXXXXXXXXXXX 

XXXXXXXXXXXXXX! 

XXXXXXXXXXXXXX 

XXXXXXXXXXXXXXX 

XXXXXXXXXXXXXX 

xxxxxxxxxxxxxxx 


Fig.  5.  Gray  level  detection  based  on  quadratic  quantile 
statistics.  Binary  levels  of  reference  pattern 
corrupted  with  Gaussian  N(0,  i)  and  N(.  5,  1)  noise. 
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ROBUSTIZED  VECTOR  FORM  OF  GLADYSHEV'S  THEOREM  WITH  APPLICATION 
TO  ESTIMATION  OF  PARAMETERS  IN  A LINEAR  MODEL 

I.  Kadar  and  L.  Kurz 

In  this  report,  the  theory  of  robustized  stochastic  approximations  of  Gladyshev 1 s 
form,  presented  by  the  authors  in  the  scalar  form  in  a recent  report,  1 is  extended  to 
the  vector  case.  The  robustizing  is  accomplished  along  the  lines  suggested  by  Kersten 
and  Kurz.  The  resulting  algorithm  is  then  applied  to  the  important  pioblem  of  para- 
meter estimation  in  a linear  model. 

A.  Vector  Extension  of  Gladyshev's  Theorem  (SAMVLS) 

Consider  the  linear  model  in  its  vector  form  by  having  multiple  observations  of 
y,  yk  available.  Then 

yk=  Xpk+  Vk,  k = 1,  2,  ...  (1) 

The  vector  SAMVLS  formulation  (see  Ref.  1 for  the  scalar  formulation)  for  (1)  becomes 

5*1  = K • <AkA)[xTxPk  - xTyk] 

where.  A,  is  an  (rxr)  diagonal  adaptive  gain  matrix  which  satisfies  the  conditions  of 

k * 

Theorem  1 of  Reference  2. 

Y(p,  P)  = XT(Xp  - y)  = XTX  (P  - P ) + XTV 
and  the  associated  regression  function 

M(P,  p)  = XTX(P-P) 

satisfies  conditions  (iv)  to  (vi),  where  it  is  clear  with  a = o that  M(P,  p)  = a=ohas  a 
unique  root  at  P = p.  The  B matrix  B = X X = lrxr  (by  the  definition  of  the  problem)  is 

a positive  definite  rxr  matrix  s.t.  |]p||  < or  which  is  satisfied  with  X orthogonal  s.t. 

T 

X X = I.  This  makes  P = I and  Ak  a diagonal  matrix  for  each  k. 

The  additive  noise  term 

Z(P)  = XTV 

satisfies  condition  (ii)  with  EZ(p)  = 0.  It  must  have  uniformly  bounded  variance,  as  in 
the  scalar  case,  and  well-defined  covariance  matrix  as  P -*  p.a.s.  Condition  (viia), 
sup  E||Z(p)||  e < oo  for  some  t > 0 is  satisfied  with 


l+£ 


< oo 


r ,vt_ v.  ii+s  rv  2 in 
sup  itr^X  r X) J = |_  2j  Oil  J 

p i=1 

^Conditions  mentioned  hereafter  refer  to  Theorem  1 of  Reference  2. 
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iim  e[z(P)ZT(P)]=  tt  , 

P->P 

where  TT  is  a non-negative  definite  matrix,  is  satisfied  with 

lim  XTrX  = 02[l]rxr  = tt  . 

P-»P 

Condition  (viii)  for  the  adaptive  gain  matrix  is  satisfied.  Let  a > a^^  > . . . > a > 0 
be  the  eigenvalues  of  AR  with  Ak  diagonal.  Similarly,  bj  > b2  > . . . > b > 0 are  eigen- 
values of  B,  which  is  diagonal  and  a^  > a^  > • • • > a^  > 0 are  eigenvalues  of  A,  where 


£ r/wn2!1/* 


P i=1 


< a < oc  wpl  , 


for  k large  and  a^{*)  > aj  > 0 wpl,  where  A is  a constant  matrix  s.t.a'  < 

\r  Jl/2 

L ai  5 a*  Slnce  A 1S  diagonal.  Condition  (ix)  a' b - e > l/2  is  clearly  satisfied 

Li=  1 

with  distinct  nonzero  eigenvalues  of  A and  B,  both  diagonal. 


Theorem 


Under  assumptions  (i)  to  (ix)  stateo  in  Theorem  1 of  Ref.  2 and  shown  to  be 
satisfied  above,  let  r^  > > ...  > rr  > 0 be  eigenvalues  of  AB,  B = X^X  = I . Then 

k (Pk"  P)  *s  asymptotically  normal  with  mean  zero  and  covariance  matrix  Q where 

Q is  a diagonal  matrix  whose  elements  are  a2  O2  [2a..  - l]"1.  For  proof  see  Reference  2. 


B.  Robust  Estimation  of  the  Parameters  in  a Linear  Model 

We  apply  the  theory  established  in  the  previous  section  to  the  estimation  of  the 
parameters  of  a reparameterized,  replicated,  full-rank,  two-way  layout  with  K ob- 
servations per  cell  fixed  effects  experimental  design  model  (i.e.,  a linear  model). 

In  matrix  notation,  the  model  equation  becomes, 


yk  = xPk  + Vk  , k = 1,  2,  ... 


(2) 


where  X could  be,  for  instance,  a (4x3)  orthogonal  design  matrix 


Pk  is  a (3x1)  vector  of  parameters  and  is  a (4x1)  additive  symmetrically  distributed 
noise  vector,  i.i.d  for  each  k. 
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To  robustize  Eq.  (2),  one  introduces  batch  pre-processing  and  a nonparametric 
statistic  (in  a manner  similar  to  Ref.  1 for  the  scalar  case)  applied  to  each  (r=  3) 
component  of 

Y<W  « = xT[x(Pktl  - W * vk+i  j • ■ 

say 

m m 

s™(-)  = Yj  sgn(Zi  + Zj)  + V sgn(Z.)  . I = 1,  2,  3 
i>j  i= 1 

where  S^  ( ) in  this  case  is  a 3x1  vector  operator,  and  the  robust  vector  operator,  and 
the  robust  vector  SAMV LS  is  of  the  form: 

Vl  '?k-jAkSmWk.«  (3) 

S™(  ) for  each  l = 1,  2,  3 is  a symmetric  version  of  WSRNS.  The  use  of  Srn(  ) only 
requires  that  the  class  of  noise  distributions  be  symmetric  and  that  Sm(Y)  satisfy 
conditions  (iv)  to  (vi)  of  Theorem  1 of  Reference  2. 

To  obtain  the  components  of  the  optimum  gain  matrix,  one  must  find  the  appropri- 
ate component  of  the  robustized  regression  function  and  evaluate  the  derivative  at 
p - P in  the  same  manner  as  the  scalar  case.  To  accomplish  this,  define  for  each  r 
(r=  3)  components 

mi-1  m 

=1  l U(Z1+Z  ) + V U(Z.)  , I = 1,  2,  3 
i=  1 j=l  i=l 


where, 

2Uj  ( ) - m(m  + 1 )/ 2 

2U^(  ) - m(m+  l)/2 

2U^(  ) - m(m  + 1 )/ 2 
' — 

Then  it  can  be  shown  by  adapting  results  from  the  scalar  case  to  the  vector  case,  that 
for  each  r=  3 component  of  the  regression  function  with  XTX  = 4[ll 

3x3 


) 

S?<  ) 

II 

S™(  ) 
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EUm  = 


m i - 1 op 

S l J . 


m oc 

fj  (x)  dx  + £ / 

i=  1 


gj(x)dx 


-4(PrPik) 


m i - 1 oc 

S E J 


1=1 '■*  -"‘MW 


r2(x)dx  * V j g2(x)dx 


m i - 1 oc 

EE  J 

,'1,=l  -"W 


f3(x)dx  + £ J 


i“l  -4(P3*P3k) 


gj(x)dx 


where,  (x),  i = I,  2,  3 are  the  components  of  the  pdf  of  XT(Vj  + V.)  and  (x), 

^ ~ It  2,  3 are  the  components  of  the  pdf  of  X V . The  slope  of  the  regression  function 
is  given  by,  for  each  of  the  r - 3 components, 


B = 


m 


‘ mi-1 

2E  £ 4f  (0)  + 2 J 4g  (0) 

i=lj=l 


i=  1 


[m  i - 1 m \ 

2 J £ 4f2(0)  + 2 V 4g2(0) 

i=  1 j=  1 i=  1 J 


m i - 1 m 

2 2 S4£3(°)  + 2 £ 483(°) 

L i=lj=l  i=  1 


and  the  diagonal  elements  of  B become 

af<  = 8^n(m  - l)ff/(0)  + mg/<  (0)]  , <=1,2,  3 


where  fft  (0)  and  g^  (0)  are  the  maximum  values  of  the  pdf's  of  r = 3 components  of  the 
vectors 


’2(Vl  + V2  + V3  + V4) 

’<V1+V2  + V3  + V4> 

2(Vi  + V2  - v3  -V4> 

and 

(Vl+V2-V3-V4’ 

2(V  1 ' V2  + V3  -V4> 

tt  — 

<Vl-V2+V3-V4> 

respectively. 

If  the  pdf  of  Vk  is  known  for  each  k,  then  the  elements  of  the  optimum  gain 
matrix,  A,  in  terms  of  the  's  can  be  computed.  However,  the  symmetric  pdf  of 
the  i.i.d.  vector  sequence,  Vk»  may  not  be  known.  This  means  that  we  have  to  find 
an  estimator  of  both  f^(0)  and  (0),  f = 1,  2,  3 in  order  to  find  the  elements  of  the 
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adaptive  gain  matrix.  To  this  end,  we  represent  the  unknown  class  of  cdf's  by  a 
vector  extension  of  the  generalized  Gaussian  noise 

3 

gc(x)  = ir  gNx) 

/ =1  1 


where  g C(x)=  [c/2A(c ) T ( l/c)  ] exp  { - | x | /A(c ) ]C  ] , i = 1 , 2,  3 A(c)  = [ c ] T ( l/c)  T (3/c)  ] 

c x Z * 

and  the  covariance  of  g (x)  is  diagonal  with  elements  c , l = 1,  2,  3.  The  range  of 

values  of  g^(0),  t = 1,  2,  3 between  I < c < 4 (which  represent  a large  class  of  CDF's) 

is  only  2:1.  This  means  that  the  diagonal  elements  of  can  be  approximated  as 

st  8m2gu  (0)  , 1=1,  2,  3. 


Now  to  obtain  the  diagonal  elements  of  the  adaptive  gain  matrix,  a robust  esti- 
2 - 1 1 Z 

mator  of  [8m  (0)]  is  formed  ’ which,  in  terms  of  the  elements  of  A^(  ) is 

given  by 


k-1 

m + 1 v 

8(k-  1)  L 
j=l 


,n 


[5=1+1 


- z 


tl 

-j.  q ] 


/ = 1,  2,  3 


where 


3’  C™]+  1 


= [*^1+  1-th  order  statistic  from  the  component  of  random  samples 


Yi+m(k-i)’  *■  ” m ^ 1 with  pdf  g(x)  = 7 T g ^ (x),  where  ] is  defined  to  be 

the  greatest  integer  less  than  or  equal  to^  . 


Reflecting  back  at  this  point  to  the  requirements  of  Theorem  1 of  Ref.  2,  we 
see,  from  condition  (vi)  that  P = [l]3xj  with  B = 4 [I l3jc3 . The  additive  noise  term 
Z(Pk)  = SmY(  ) -M(  ) has  been  shown  to  satisfy  condition  (vii)  for  the  same  functional 
form;  and  the  adaptive  diagonal  gain  matrix  A^(  . ) with  B = 4[l]^^  has  also  been 
shown  to  satisfy  condition  (viii)  in  Ref.  2 for  a similar  form  of  the  batch-order 
statistic  estimator  of  the  elements  of  Ak(  • ).  Thus,  a flexible  robustized  algorithm 
for  estimating  the  parameters  of  the  linear  model  has  been  generated.  Applications 
of  this  algorithm  to  problems  of  image  processing,  pattern  recognition,  radar  and 
sonar  data  processing  are  the  subject  for  future  investigation. 
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“Z^al  investigations  involving  basic  measurements,  development  of  devices, 
and  materials.  

Each  activity  i\eports  in  summary  fashion  on  specific  results  obtained 
during  the  report  period,  IS  September  1977  through  14  September  1978, 
with  individual  acknowledgement  of  the  sponsorship  which  has  contributed 
to  the  reported  work.'^The  report  is  organized  into  two  major  divisions. 

The  first.  Electrophysics,  includes  the  topics  of:  Electromagnetics; 
Acoustics;  Optics;  Quantum  Electronics;  Solid  State  and  Materials;  Wave- 
Matter  Interactions;  and  Electric  Power  Engineering.  The  second.  Sys- 
tems, includes  the  topics  of:  Communications;  Computer  and  Computer- 
Communications  Networks;  Safely,  Reliability  and  Software  Engineering; 
Systems,  Control  and  Networks;  and  Data  Processing. 
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